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(57)Abstract: 

PROBLEM TO BE SOLVED: To make information 
retrieval results provided by a retrieval engine to be more 
appropriate to a retrieval target of an information 
retriever. 

SOLUTION: A character enhancement analyzing part 
124 acquires an attribute showing enhancement given to 
a character string included in a web page 20. An index 
registering part 126 associates logical position 
information of the web page 20 and an enhancement 
attribute about a character string being an extraction 
source of a word with the word extracted from the 
character string by a character string analyzing part 125 
to register them in an index file 310. An information 
retrieving part 210 acquires position information 
associated with a word representing a retrieval target. A retrieval result managing part 220 
sorts the acquired position information to make a word with an enhancement attribute 
associated to take precedence with respect to a word made to be an object of retrieval of the 
position information. An HTML preparing part 410 prepares an HTML file representing the 
sorted position information and transmits the HTML file to the Internet 4. 
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LThis document has been translated by computer. So the translation may not reflect the 
original precisely. 

2.**** shows the word which can not be translated. 
3.ln the drawings, any words are not translated. 



[Claim(s)] 

[Claim 1]A word contained in document information characterized by comprising the following 
currently released on a communication network, An index file which matches position 
information which shows a document information position in which it is the information which 
shows a logical position on this communication network, and information containing this word 
exists is prepared, A system which presents position information corresponding to a word 
which searches this index file based on a word showing an object of search, and expresses 
this retrieval object. 

An emphasis-attributes acquisition means which acquires emphasis attributes which are 
attributes given to a character string contained in said document information, and show 
emphasis. 

An extraction means to extract a word from said character string. 

A registration means to match said position information about this word, and said emphasis 
attributes given to a character string of extraction origin of this word with a word extracted by 
said extraction means, and to register with said index file. 

A search means to acquire position information which searches said index file based on a 
word showing said retrieval object, and is matched with this word from this index file, A 
presenting means which gives priority to that by which said emphasis attributes are matched 
with a word which this search means made an object of search of this position information in 
said index file among position information acquired by said search means, and presents this 
position information. 

[Claim 2]The information retrieval system according to claim 1 , wherein an attribute given to a 
character string contained in said document information is an attribute which shows color of a 
character used in order to display an attribute which shows a size of a character used when 
displaying this character string, or this character string. 
[Claim 3]The information retrieval system comprising according to claim 1 or 2: 
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A frequency-of-occurrence calculating means which computes the frequency of occurrence in 
this document information about an attribute given to a character string by which said 
emphasis-attributes acquisition means is included in said document information for this every 
attribute. 

An emphasis-attributes setting-out means to set up a standard which distinguishes whether 
said attributes are said emphasis attributes based on the frequency of occurrence for this 
every attribute, and an emphasis-attributes discriminating means which distinguishes whether 
attributes given to a character string contained in said document information based on said 
standard are said emphasis attributes. 

[Claim 4]the information retrieval system according to claim 1 which is rich and is characterized 
by a thing which is said emphasis attributes about this attribute, and to make when said 
emphasis-attributes acquisition means shows that a character used in order that an attribute 
given to a character string contained in said document information may display this character 
string is made into a bold letter. 

[Claim 5]lnside of position information from which said presenting means was acquired by said 
search means, The information retrieval system according to claim 1 giving priority to a thing 
which has many number of said emphasis attributes matched with a word which this search 
means made an object of search of this position information in said search file, and showing 
this position information. 

[Claim 6]An index file which matches position information which shows a document information 
position characterized by comprising the following in which information exists is prepared, It is 
the method of showing position information corresponding to a word which searches this index 
file based on a word showing an object of search, and expresses this retrieval object, 
Emphasis attributes which are attributes given to a character string contained in said document 
information, and show emphasis are acquired, Match said position information about this word, 
and said emphasis attributes given to a character string of extraction origin of this word with a 
word which extracted a word from said character string and was extracted from said character 
string, and it registers with said index file, Position information which searches said index file 
based on a word showing said retrieval object, and is matched with this word is acquired from 
this index file, An information retrieval method characterized by what priority is given to that by 
which said emphasis attributes are matched with a word made into an object of search of this 
position information in said search file among position information acquired by said search, 
and this position information is shown for. 

A word contained in document information currently released on a communication network. 
It is the information which shows a logical position on this communication network, and is this 
word. 

[Claim 7]By performing a computer, Processing which prepares an index file which matches 
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position information which shows a document information position in which it is the information 
which shows a logical position on a word contained in document information currently released 
on a communication network, and this communication network, and information containing this 
word exists, It is a program for making processing which presents position information 
corresponding to a word which searches this index file based on a word showing an object of 
search, and expresses this retrieval object perform to this computer, Processing which 
acquires emphasis attributes which are attributes given to a character string contained in said 
document information, and show emphasis, Processing which matches processing which 
extracts a word from said character string, and said emphasis attributes given to a character 
string of said position information [ about this word ], and extraction origin of this word at a 
word extracted from said character string, and is registered into said index file, Inside of 
processing which acquires position information which searches said index file based on a word 
showing said retrieval object, and is matched with this word from this index file, and position 
information acquired by said search, A program for making processing which gives priority to 
that by which said emphasis attributes are matched with a word made into an object of search 
of this position information in said search file, and presents this position information perform to 
a computer. 



[Translation done.] 
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LThis document has been translated by computer. So the translation may not reflect the 
original precisely. 

2.**** shows the word which can not be translated. 
3.ln the drawings, any words are not translated. 



[Detailed Description of the Invention] 
[0001] 

[Field of the lnvention]This invention relates to the art of enabling it to provide the information 
which agreed appropriately by the demand, to the demand of search especially about the art of 
retrieving information. 
[0002] 

[Description of the Prior Art]ln recent years, the number of the Web pages provided by the 
WWW (WorldWideWeb) system on the Internet is continuing increasing explosively by the 
spread of the Internet. On the Internet, many search engines which provide the service which 
retrieves the information made into the purpose out of this huge information are established. 
[0003]There are some which are called the robot type as one of the methods which collects the 
information on a network of a search engine. In a robot type search engine, the robot program 
called a spider or a crawler is started periodically, Automatic collection of the HTML 
(HyperText Markup Language) file expressing the Web page currently exhibited on the Internet 
is performed. When information retrieval is performed and the information retrieval person 
using a search engine gives a closely related keyword to the target information at a search 
engine site, Processing which extracts that in which the keyword was contained from the 
collected file is performed, and an information retrieval person is provided with the list of Web 
pages which are the keyword and which are contained as search results with the information 
which shows the logical position on the Internet about the Web page. 
[0004] 

[Problem(s) to be Solved by the lnvention]Generally, since the robot type search engine is 
performing [ no ] automatically processings of a to [ from collection of information / offer of 
search results ] by computer and operation of the information by judgment of human being 
intervenes there, The arrangement about the quality of the genre to which the collected 
information belongs, or its information is not made. Therefore, if search by coincidence of a 
mere keyword was performed on the occasion of search of information, a Web page including 
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important information is buried in search results, or. Or there were not few cases where it will 

be mostly contained in search results only in about the Web page in which what is called a 

search noise, i.e., the low information on usefulness, is contained. 

[0005]Making more suitable the result of the information retrieval which a search engine 

provides in view of the above problem to an information retrieval person's retrieval object is the 

issue which this invention tends to solve. 

[0006] 

[Means for Solving the ProblemJA word contained in document information to which this 
invention is opened on a communication network, An index file which matches position 
information which shows a document information position in which it is the information which 
shows a logical position on this communication network, and information containing this word 
exists is prepared, It is premised on a system or a method of showing position information 
corresponding to a word which searches this index file based on a word showing an object of 
search, and expresses this retrieval object. 

[0007]And an information retrieval system which is one of the modes of this invention, An 
emphasis-attributes acquisition means which acquires emphasis attributes which are attributes 
given to a character string contained in said document information, and show emphasis, In a 
word extracted by extraction means to extract a word from said character string, and said 
extraction means. A registration means to match said position information about this word, and 
said emphasis attributes given to a character string of extraction origin of this word, and to 
register with said index file, A search means to acquire position information which searches 
said index file based on a word showing said retrieval object, and is matched with this word 
from this index file, A presenting means which gives priority to that by which said emphasis 
attributes are matched with a word which this search means made an object of search of this 
position information in said index file among position information acquired by said search 
means, and presents this position information, SUBJECT mentioned above is solved by 
constituting so that it may ****. 

[0008]An attribute given to a character string contained in said document information here is 
an attribute which shows color of a character used in order to display an attribute which shows 
a size of a character used when displaying this character string, for example, or this character 
string. It is possible that document information containing a word emphasis is instructed to be 
has a high possibility that information that importance is high is included about the word. 
Therefore, when two or more document information corresponding to a word which shows a 
search condition is opened to a communication network according to composition mentioned 
above, Since the word is emphasized, and priority is given to position information about 
document information considered that importance is high, it makes and it is shown, a result of 
information retrieval will become more suitable to an information retrieval person's retrieval 
object. 

[0009]ln an information retrieval system concerning this invention mentioned above, said 
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emphasis-attributes acquisition means, Afrequency-of-occurrence calculating means which 
computes the frequency of occurrence in this document information about an attribute given to 
a character string contained in said document information for this every attribute, An emphasis- 
attributes setting-out means to set up a standard which distinguishes whether said attributes 
are said emphasis attributes based on the frequency of occurrence for this every attribute, It 
may constitute so that it may have an emphasis-attributes discriminating means which 
distinguishes whether attributes given to a character string contained in said document 
information are said emphasis attributes based on said standard. 
[0010]Since it can judge now that a character string to which a unique attribute is given in 
document information is emphasized in that document information according to this 
composition, this document information can be registered into an index file for a word 
contained in this character string as what has high importance. 

[0011]ln an information retrieval system concerning this invention mentioned above, said 
emphasis-attributes acquisition means, When making into a bold letter a character used in 
order that an attribute given to a character string contained in said document information may 
display this character string is shown, it may be made to consider that these attributes are said 
emphasis attributes. 

[001 2] According to this composition, in order to show emphasis of a character string in 
document information, the attribute of a purport that a display by a bold letter currently 
performed widely is performed can be promptly judged with emphasis attributes. In an 
information retrieval system concerning this invention mentioned above, said presenting 
means, Priority is given to a thing which has many number of said emphasis attributes 
matched with a word which this search means made an object of search of this position 
information in said search file among position information acquired by said search means, and 
it may be made to show this position information. 

[0013]According to this composition, for a word by which a thing which has many number of 
emphasis attributes given to a certain character string is contained in this character string, this 
document information can be registered now into an index file as what has higher importance, 
and a result of information retrieval will become still more suitable to an information retrieval 
person's retrieval object. 

[0014]An information retrieval method which is one of the another modes of this invention, 
Emphasis attributes which are attributes given to a character string contained in said document 
information, and show emphasis are acquired, Match said position information about this word, 
and said emphasis attributes given to a character string of extraction origin of this word with a 
word which extracted a word from said character string and was extracted from said character 
string, and it registers with said index file, Position information which searches said index file 
based on a word showing said retrieval object, and is matched with this word is acquired from 
this index file, The same operation and effect as an information retrieval system concerning 
this invention mentioned above are acquired by giving priority to that by which said emphasis 
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attributes are matched with a word made into an object of search of this position information in 
said search file among position information acquired by said search, and showing this position 
information. 

[0015]SUBJECT mentioned above by making a computer execute the program also in a 
program for making processing which consists of the same procedure as an information 
retrieval method concerning this invention mentioned above perform to a computer is solvable. 

[0016] 

[Embodiment of the lnvention]Hereafter, an embodiment of the invention is described based on 
a drawing. Drawing 1 is a figure in which the information retrieval site which carries out this 
invention shows the entire configuration of the communication network which provides an 
information search service. 

[0017]ln drawing 1 , the Internet 4 which is all a communication network is accessed, and data 
can be delivered [ the information retrieval site 1 the offer-of-information site 2a, 2b, 2c, 2d, 
and the user terminals 3a and 3b ] and received mutually. The information retrieval site 1 is a 
WWW server system which provides a robot search type information search service for the 
user terminals 3a and 3b, is provided with the Research and Data Processing Department 100, 
the information retrieval Management Department 200, the information database Management 
Department 300, and the WWW server Management Department 400, and is constituted. 
[0018]The Research and Data Processing Department 100 performs automatic collection of 
the information currently released on the Internet 4, and accumulates the collected information 
in the information database Management Department 300. The information retrieval 
Management Department 200 retrieves the information accumulated in the information 
database Management Department 300 according to the demand of the information retrieval 
sent via the Internet 4, and returns the result of the search to a requiring agency. 
[0019]At the information database Management Department 300, accumulation of the 
information collected by the Research and Data Processing Department 100 and search of the 
information by the information retrieval Management Department 200 are performed. The 
processing by which the WWW server part 400 transmits the collected information which is 
sent via the Internet 4 to the Research and Data Processing Department 100, Processing 
which transmits the demand of the information retrieval sent via the Internet 4 to the 
information retrieval Management Department 200, and processing of sending out of a Web 
page in which the information which shows the result of the information retrieval sent from the 
information retrieval Management Department 200 is expressed are performed. 
[0020]The offer-of-information site 2a, 2b, and 2c and 2d are WWW server systems which 
exhibit Web pages 20a, 20b, 20c, and 20d on the Internet 4, respectively. Although four offer- 
of-information sites are shown in drawing 1 , the number of the offer-of-information sites 
connected to the Internet 4 may be arbitrary. 

[0021 ]The user terminals 3a and 3b, respectively The offer-of-information site 2a, 2b, 2c, And it 
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is a computer which can perform the browsers 30a and 30b which are the software which 
peruses the Web page provided from 2 d or the information retrieval site 1, and is operated by 
the information retrieval person who requests search of the information currently released on 
the Internet 4 to the information retrieval site 1. Although two users are shown in drawi ng 1, the 
number of the user terminals connected to the Internet 4 may also be arbitrary. 
[0022]These information retrieval sites 1, the offer-of-information site 2a, 2b, 2c and 2d, and 
the user terminals 3a and 3b, The computer by which all have standard composition, i.e., CPU 
which controls each component by executing a control program, The storage parts store used 
as the work area at the time of the memory and CPU of a control program which consist of a 
ROM, RAM, a magnetic storage device, etc., and make CPU control each component 
executing a control program, or a storage area of various data, It can also constitute using a 
computer provided with the input part from which various kinds of data corresponding to 
operation by a user is acquired, the outputting part which show a display etc. various kinds of 
data and of which a user is notified, and the l/F part which provides the interface function for 
connecting with a network. 

[0023]Next, drawing 2 is explained. The figure shows the detailed composition of the 
information retrieval site 1 in drawing 1 which carries out this invention. As shown in drawing 2 , 
the Research and Data Processing Department 100 has the Web page collection Management 
Department 110 and the index production part 120, and is constituted, The information 
management retrieval part 200 is provided with the information retrieval section 210 and the 
search-results Management Department 220, and is constituted, The database manager 300 
has the index file 310, the search-results file 320, and the search-results list file 330, and is 
constituted, and WWW server 400 is provided with the HTML preparing part 410, and is 
constituted. . 

[0024]The Web page collection Management Department 110 performs periodically automatic 
collection of Web page 20 currently exhibited on the Internet 4. The position information on 
Web page 20 by which the index build Management Department 120 was collected by the 
Web page collection Management Department 110, That is, the record used as the index 
which can lengthen the position information which shows the logical position on the Internet 4 
in which Web page 20 exists is created, and it registers with the index file 310. The index build 
Management Department 120 has the Web page analyzing parts 121, the word extraction 
Management Department 123, and the index registering part 126, and is constituted. 
[0025]The Web page analyzing parts 121 create the HTML filter table 122 which makes the 
unit of a record each HTML tag described by the text of the HTML file which analyzes Web 
page 20 and is expressing Web page 20. processing which analyzes the conditions of the 
character style which is emphasized when they are displayed as a screen of Web page 20 in 
the character string shown in the HTML filter table 122, and which can be been [ character 
style / it ] rich and made being performed by the character emphasis analyzing parts 124, and 
at the word extraction Management Department 123, Analysis of the character string shown in 
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the HTML filter table 122 is conducted by the character string analyzing parts 125, and a word 
is extracted from the character string. 

[0026]The logical position information [ the index registering part 126 makes an entry the word 
extracted by the character string analyzing parts 125 and / entry / the ] on the Internet 4 about 
Web page 20, The index record in which the abstract of Web page 20 in which the word was 
contained, and the form set as the word by Web page 20 matched the attribute flag which 
shows that it agrees on the conditions of the character style obtained in the analysis in the 
character emphasis analyzing parts 124 is registered into the index file 310. 
[0027]The information retrieval section 210 acquires the demand of the information retrieval 
sent from the user terminal by control of the browser 30 currently performed with one of the 
user terminals connected to the Internet 4 from the WWW server part 400, The search formula 
showing the conditions of the information retrieval is taken out from the demand, and it stores 
in the search formula storage 21 1 . And the word (keyword) which searches the index file 310 
and is shown in the search formula acquires the index record used as a title, and stores in the 
search-results file 320. 

[0028]lf search by the information retrieval section 210 is completed, the search-results 
Management Department 220, The position information and the abstract which are shown in 
the index data stored in the search-results file 320, and the total number of the attribute flag 
attached with the index record corresponding to the position information are stored in the 
search-results list file 330. And the position information stored in the search-results list file 330 
is sorted according to the total number. 

[0029]The HTML preparing part 410 creates the HTML file expressing the Web page which 
receives the search-results list which consists of sorted position information which is stored in 
the search-results list file 330 and as which the search-results list is expressed. The created 
HTML file is addressed to the user terminal in which the browser 30 is performed, and is sent 
out to the Internet 4 by the WWW server part 400. 

[0030]Next, the details of processing of the collection of a Web page performed in the 
Research and Data Processing Department 100 which the information retrieval site 1 has, and 
generation of an index are explained. Dr awing 3 shows the example of Web page 20 which is 
opened to the Internet 4 and collected by the information retrieval site 1. In the figure, the 
browser's 30 inspection of the HTML sauce shown in (b) will display the screen shown in the 
figure (a). 

[0031] Drawing 4 is explained here. The figure is a flow chart which shows the contents of 
processing of the index production processing performed in the Research and Data 
Processing Department 100. By performing this processing, collection of a Web page and 
generation of an index are performed in the Research and Data Processing Department 100. 
First, in S101, only when it was distinguished at the Web page collection Management 
Department 1 10 whether the present date is the collection designated date of Web page 20 
specified beforehand, this decision result is set to Yes and the present becomes that 
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designated date, processing progresses to S102. Although the method of specification of this 
date is arbitrary, specification called the monthly final day of month end, etc. is performed, for 
example. 

[0032]ln S102, processing of a round and collection of Web page 20 currently exhibited on the 
Internet 4 is performed by the Web page collection Management Department 110. The 
technique of this round and collection should just use as it is what is performed from the former 
by the well-known robot type search engine. 

[0033]ln the Web page analyzing parts 121 , the initial value 1 is substituted for the variable m 
used as page pointers which are pointers for pointing at a time to 1 page of Web pages 20 of a 
large number collected at the front step S103. The structure of the page to which it points by 
the current value of the page pointers m in Web page 20 collected by processing of S102 in 
S104 is analyzed by the Web page analyzing parts 121 , The HTML filter table 122 is 
generated by the Web page analyzing parts 121 in S105 continuing. 
[0034]The HTML filter table generated from the Web page shown in drawing 3 is shown in 
drawing 5 . Analysis of the HTML sauce shown in drawing 3 (b) by the Web page analyzing 
parts 121 will generate the HTML filter table shown in drawing 5 . When it explains further, 
referring to drawing 3 (b) for the contents of processing of S104, in the Web page analyzing 
parts 121 . The text of the HTML sauce of an analytical object, i.e., the portion pinched between 
the start tag of <BODY> and the end tag, is made into the object of analysis, and the structure 
of each sentence where the <BR> tag (line feed tag) in the portion is considered as a pause of 
a sentence, and is contained in the text is analyzed. 

[0035]lf signs that the HTML filter table shown in drawing 5 from the HTML sauce shown in 
drawing 3 (b) is created are explained, First, let the portion pinched between the start tag of 
the <BODY> tag and end tag which are the description parts of the text in HTML sauce, i.e., 
the portion put between the <BODY> tag and the </BODY> tag, be an object of the analysis by 
processing of S104. 

[0036]here - first - an analytical object - a portion - it can set - the beginning - < - BR - > - 
- a tag - describing - having - **** - a part - up to - a portion - namely, - "- < - FONT 
SIZE - = - " - six - " - COLOR - = - " - # - FF - 0000 - " - > - < - B - > - easy - 
cooking - < - /-- B - > - < - /-- FONT - > - " - a portion is analyzed. <FONT 
SIZE="6"COLOR="#FF0000"> Here the becoming tag, the "easy dish" described by the 
portion pinched between the start tag of <FONT>, and the end tag - a character string - 
character size -- "6" - considering it as a size - and a character color -- "#FF0000" - what is 
displayed as a color shown numerically is shown. The color shown for this figure is red. 
[0037]Displaying the character string "the easy dish" <B> [ which is described by the portion by 
which the becoming tag is sandwiched between the start tag of <FONT> and the end tag ] 
Becoming in a bold letter is shown. The analysis processing by S104 conducts analysis 
mentioned above, and the record shown in the 1st line of d rawing 5 which means the contents 
of this analysis result is stored in the HTML page filter 122 by processing of S105 performed 
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next. An "easy dish" is stored in the column of a "character string" when this record is 
explained, "STRING" which shows that an "easy dish" is a character string is stored in the 
column of "classification", and the attribute which shows that "6" and a "color" are displayed for 
"character size" as "red" about the character string of an "easy dish" is stored in the column of 
a "character string attribute." By furthermore storing the flag "1" in the column of the "bold 
letter" in a "character string attribute" shows that displaying the character string of an "easy 
dish" in a bold letter was shown. And the record in which existence of the <BR> tag which only 
makes it the contents for "classification" to be "BR" after the is shown is stored in the 2nd line 
of the HTML page filter 122. 

[0038]Analysis of the remaining portion in the text part of the HTML sauce shown in drawing 3 
(b) is conducted similarly hereafter, and the HTML page filter 122 shown in drawing 5 in this 
way is generated. Although the SIZE attribute in the <FONT> tag may be shown like "+1", "-1", 
etc. by the relative value over the usual displayed character size, When such, the relative 
value is registered as character size, and about the character string by which specification of 
character size is not made, "0" is registered as character size. 

[0039]lt returns to explanation of drawing 4 and processing which analyzes the standard of the 
character string attribute performing setting out of the standard of an index attribute, i.e., 
highlighting, can consider that it is set up, and sets up the standard is performed by the 
character emphasis analyzing parts 124 in the character string stored in the HTML page filter 
122 in S106. The details of this processing are mentioned later. 

[0040]The initial value 1 is substituted for the variable n used as a filter pointer which is a 
pointer for specifying one [ at a time ] each line (record) of the HTML filter table 122 in order by 
the character string analyzing parts 125 S107. It is distinguished by the character string 
analyzing parts 125 whether the data in which the classification of the character string shown 
in the line specified by the filter pointer n mentioned above in S 108 is shown is "STRING", if 
the result of this distinction becomes in Yes, processing will progress to S109, and if No 
becomes, processing will progress to S115. 

[0041 ]ln S109, processing which sets an index attribute as the character string shown in the 
line specified by the filter pointer n based on the standard set up by processing of S 106 is 
performed by the character string analyzing parts 125. The details of this processing are also 
mentioned later. Then, in [ processing which starts a word from the character string shown in 
the line specified by the filter pointer n in S1 10 is performed by the character string analyzing 
parts 125, and ] S1 1 1, Processing from which the part of speech extracts the word which is a 
noun is continuously performed by the character string analyzing parts 125 from the started 
word. 

[0042]A well-known method is adopted as processing of logging of the word in S1 10. The 
method made into the word which used what is called a morphological analysis, for example, 
acquired the canonical form of that word for the part of speech and conjugated form of the 
word which were started as a method of this common knowledge using various kinds of 
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dictionaries, and started the word of that canonical form from the character string, There are 
what is called an N gram method etc. that start the word of length N mechanically in order 
shifting logging of a character string of one character at a time from the head of the character 
string. 

[0043]ln S1 12, it is distinguished by the character string analyzing parts 125 whether the word 
extracted by the processing of S1 1 1 mentioned above existed, if this decision result becomes 
in Yes, processing will progress to S1 13, and if No becomes, processing will progress to S1 15. 
The position information which is the page which made the title the word extracted by the 
processing of S1 1 1 mentioned above in S1 13, and in which the word was contained, The 
index which matched with the word of the title the abstract of the text indicated to the page and 
the character attribute set up by processing of S109 to the character string in which the word 
was contained is generated by the index registering part 126, The index generated by 
processing of Scontinuing 1 14 is registered into the index file 310. 
[0044]ln S115, directions of the filter pointer n mentioned above by the character string 
analyzing parts 125 are advanced only 1. It is distinguished by the character string analyzing 
parts 125 whether in S1 16, the line specified by the present numerical value of the filter pointer 
n has exceeded the last line that exists in the HTML filter table 122, If this discriminated result 
becomes in Yes, processing will progress to S1 17, and if No becomes, the processing which 
processing returned and mentioned above to S108 will be repeated. 
[0045]ln S1 17, the directions of the page pointers m mentioned above by the Web page 
analyzing parts 121 are advanced only 1. It is distinguished by the Web page analyzing parts 
121 whether in S1 18, the page specified by the present numerical value of the page pointers m 
has exceeded the last page of Web page 20 collected by the Web page collection 
Management Department 110, If the result of this distinction becomes in Yes, this index 
production processing will be completed. On the other hand, if the result of this discrimination 
processing becomes in No, the processing which processing returned and mentioned above to 
S104 will be repeated. 

[0046]Processing to the above is index production processing. Next, the details of the standard 
setting processing Of an index attribute performed by the character emphasis analyzing parts 
124 as processing of S106 in the index production processing mentioned above are explained. 
Drawing 6 is a flow chart which shows the contents of processing of the standard setting 
processing of an index attribute. 

[0047]First, in S121, the "classification" in the HTML filter table 122 is computed for every 
character size [ in / in the sum total of the number of characters of the character string of a line 
which is "STRING" / a "character string attribute" ]. Next, in S122, the rate of the number of 
characters of each character size to the number of characters of all the character strings 
shown, the incidence 122, i.e., the HTML filter table, of each character size, is computed. 
[0048]ln S123, the standard incidence S about the character size set up beforehand is 
acquired. In S124, it is distinguished in S125 to which total addition is carried out and the 
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incidence computed by processing of S122 follows descending of character size whether the 
cumulative value exceeded the reference value S. And when this discriminated result is set to 
Yes, processing progresses to S126. On the other hand, while this discriminated result is No, 
processing of S124 is repeated. 

[0049]ln S126, when the result of the discrimination processing of a front step is set to Yes, the 
larger character size by one than the character size corresponding to the incidence added by 
the processing of S124 in front of that is set up as the reference letter size Esize. In the 
standard setting processing of the index attribute mentioned later, it considers that the 
character string to which the attribute of the character size more than this reference letter size 
Esize is set is a character string which is performing highlighting in Web page 20. 
[0050]ln S127, the "classification" in the HTML filter table 122 is computed for every character 
color [ in / in the sum total of the number of characters of all the character strings shown in the 
line which is "STRING" / a "character string attribute" ]. In S128, the rate of the number of 
characters of each character color to the number of characters of all the character strings 
shown, the incidence 122, i.e., the HTML filter table, of each character color, is computed. 
[0051 ]ln S129, the standard incidence C about the character color set up beforehand is 
acquired. In S130, it is distinguished whether the character color Cn whose incidence is more 
than the standard incidence C exists, and only when this discriminated result is Yes, in S131, 
this character color Cn is set up as the reference color Ecolor. In the standard setting 
processing of the index attribute mentioned later, it is considered that this reference color 
Ecolor is a character string to which the character string to which the attribute of a different 
character color is set is performing highlighting in Web page 20. 

[0052]After finishing processing of S130 and S131, the standard setting processing of this 
index attribute is completed, and processing returns to drawing 4 mentioned above. 
Processing to the above is the standard setting processing of an index attribute. Next, the 
standard setting processing of the index attribute mentioned above is further explained using 
the example of drawing 8 . 

[0053] Drawin g 8 (A) is a table in which showing the incidence of the character contained in the 
becoming Web page, and showing the incidence of each character size from which ** was 
obtained by processing to S122, and a table showing the incidence of each character color 
from which ** was obtained by processing to S128 document 1. 
[0054]Now, suppose that the standard incidence S about the character size acquired by 
processing of S123 was 10%. ** It set, and the sum total of the incidence about the thing more 
than "5" is +5% [ 3% of ] = 8%, and character size is less than the standard incidence S which 
mentioned this value above. On the other hand, it is over the standard incidence S in which the 
sum total of the incidence about the thing more than "4" is +70% [ 3% of +5% of ] = 78%, and 
character size mentioned this value above. Therefore, in the discrimination processing of 
S125, when character size computes the sum total of the incidence about the thing more than 
"4" in processing of S124, the result serves as Yes. And in S126 performed at this time, the 
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larger character size by one than the character size corresponding to the incidence added by 
the processing of S124 in front of that, i.e., "5", is set up as the reference letter size Esize. 
Therefore, in the standard setting processing of the index attribute mentioned later, it considers 
that the character string to which the attribute of the character size more than "5" is set, i.e., 
the character string by which character size is set as "5" or "6", is a character string which is 
performing highlighting in Web page 20. 

[0055]Next, considering it as the thing of the maximum of the incidence about the character 
color of the character contained in the document 1 about the standard incidence C about the 
character color acquired by processing of S129 assumes that it was set up beforehand. ** Set, 
the character color of the maximum [ incidence ] is "black", and the incidence is 90%. 
Therefore, by processing of S129, since this incidence is set up as the standard incidence C, 
by processing of continuing S130 and S131, the black chisel whose incidence is not less than 
90% is set up as the reference color Ecolor at the character color whose incidence is more 
than the standard incidence C, i.e., here. Therefore, in the standard setting processing of the 
index attribute mentioned later, it is considered that black is a character string to which the 
character string to which the attribute of a different character color is set, i.e., the character 
string whose character color is "red" or "blue" here, is performing highlighting in Web page 20. 
[0056] Drawing 8 (B) is a table in which showing the incidence of the character contained in the 
becoming Web page, and showing the incidence of each character size from which ** was 
obtained by processing to S122, and a table showing the incidence of each character color 
from which ** was obtained by processing to S128 document 2. 
[0057]Now, suppose that the standard incidence S about the character size acquired by 
processing of S123 was 10% like the Web page of the document 1 . ** It set, and the sum total 
of the incidence about the thing more than "4" is 1%, and character size is less than the 
standard incidence S which mentioned this value above. On the other hand, it is over the 
standard incidence S in which the sum total of the incidence about the thing more than "3" is 
+99% [ per % ] = 100%, and character size mentioned this value above. Therefore, in the 
discrimination processing of S125, when character size computes the sum total of the 
incidence about the thing more than "3" in processing of S124, the result serves as Yes. And in 
S126 performed at this time, the larger character size by one than the character size 
corresponding to the incidence added by the processing of S124 in front of that, i.e., "4", is set 
up as the reference letter size Esize. Therefore, in the standard setting processing of the index 
attribute mentioned later, it considers that the character string to which the attribute of the 
character size more than "4" is set, i.e., the character string by which character size is set as 
"5", is a character string which is performing highlighting in Web page 20. 
[0058]Next, suppose that the standard incidence C about the character color acquired by 
processing of S129 was 10%. ** It sets and the character color whose incidence is more than 
the standard incidence C, i.e., the red whose incidence is not less than 10% here, and blue are 
set up as the reference color Ecolor in processing of S130 and S131. Therefore, in the 
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standard setting processing of the index attribute mentioned later, it is considered that it is the 
character string to which the attribute of a different character color from red or blue both is set, 
i.e., the character string to which the character string whose character color is "black" here is 
performing highlighting in Web page 20. 

[0059]Next, the details of the setting processing of an index attribute performed by the 
character string analyzing parts 125 as processing of S109 in the index production processing 
mentioned above are explained. First, in S141, the character string attribute shown in the line 
specified by the current value of the filter pointer n in the HTML filter table 122 mentioned 
above is acquired. 

[0060]ln S142, the character size in the character string attribute acquired by processing of the 
front step, In S143 it is distinguished whether it is more than the reference letter size Esize set 
up by the standard setting processing of the index attribute mentioned above, and only when 
the result of this distinction is Yes, The character size flag with which the attribute given to the 
character string shown in the line specified by the current value of the filter pointer n mentioned 
above is defined as the index attribute buffer held temporarily is set to "1 ." 
[0061]ln S144, the character color in the character string attribute acquired by processing of 
S141 , In S145 only when the result of this distinction is Yes, namely, only when it is 
distinguished whether it differs from the reference letter color Ecolor set up by the standard 
setting processing of the index attribute mentioned above, and character colors differ, The 
character color flag with which the attribute given to the character string shown in the line 
specified by the current value of the filter pointer n mentioned above is defined as the index 
attribute buffer held temporarily is set to "1." 

[0062]ln S147 it is distinguished whether the flag "1" is stored in the column of the "bold letter" 
in the character string attribute acquired by processing of S141 in S146, and only when the 
result of this distinction is Yes, The bold letter flag with which the attribute given to the 
character string shown in the line specified by the current value of the filter pointer n mentioned 
above is defined as the index attribute buffer held temporarily is set to "1 ." 
[0063]After finishing processing of S146 and S147, the setting processing of this index 
attribute is completed and processing returns to drawing 4 mentioned above. Processing to the 
above is the setting processing of an index attribute. The data structure of the index file 310 
generated by performing word index production processing of drawing 4 explained by the 
above to Web page 20 illustrated to drawin g 3 is shown in drawing 9 . In the figure, although 
the logical position on the Internet 4 to which Web page 20 shown in drawin g 3 is opened is 
shown as "the document 1", of course, it does not matter even if URL (Uniform Resource 
Locator) shows this position information, for example. 

[0064]lf the line of the entry "easy" in drawing 9 is made into an example and the index file 310 
is explained, The "easy" word is included in Web page 20 currently exhibited in the position 
"the document 1" [ on the Internet 4 ] Becoming, and it is shown in Web page 20 that all of 
highlighting according [ this word ] to character size, highlighting by a character color, and 
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highlighting by a bold letter are made, highlighting mentioned above about the "easy" word 
when referring to drawing 3 -- ******** _ things are shown. 

[0065]Next, an example is shown and explained about the details of processing of the 
information retrieval performed at the information retrieval Management Department 200 which 
the information retrieval site 1 has. Dra wing 10 is a flow chart which shows the contents of 
processing of the retrieval processing performed by the information management retrieval part 
200. First, the contents of processing of retrieval processing are explained along with the 
figures. 

[0066]ln S201, only when it is distinguished by the information retrieval section 210 whether 
the search formula in which the word which are the demand of information retrieval sent from 
the browser 30 and an object of that search is shown was received and this discriminated 
result is set to Yes, processing progresses to S202. In the information retrieval section 210, if a 
search formula is sent, the search formula is stored in the search formula storage 211. 
[0067]ln S202, the sent search formula is analyzed by the information retrieval section 210, 
and the word which is a retrieval object is started from the search formula. Search of the entry 
of the index file 310 which used the search word as the key is performed by the information 
retrieval section 210 in the turn that the search word was started S203. 
[0068]ln S204, as a result of search at a front step, it is distinguished by the information 
retrieval section 21 0 whether the entry which is in agreement with a search word was 
discovered, if the result of this distinction becomes in Yes, processing will progress to S205, 
and if No becomes, processing will progress to S207. In S206 which all of position information, 
an abstract, and an attribute flag are acquired from the retrieval record in which the entry which 
is in agreement with a search word was contained by the information retrieval section 210, and 
continues by it in S205, The record which consists of an entry which is in agreement with this 
search word, and position information, an abstract and an attribute flag is stored in the search- 
results file 320 by the information retrieval section 21 0. 

[0069]lt is distinguished [ which was mentioned above about all the search words started in 

5207 by the processing of S202 mentioned above ] by the information retrieval section 210 
whether processing of search of S203 was performed, If the result of this distinction becomes 
in Yes, processing will progress to S208, if the result of this distinction becomes on the other 
hand in No, processing will return to S203 and processing mentioned above about the search 
word in which processing of search is not yet performed will be performed. 

[0070]The position information applicable to all the search words started by the processing of 
S202 mentioned above among the position information stored in the search-results file 320 in 

5208 here, That is, the position information included common to all the records of the search- 
results file 320 is extracted from the search-results file 320 by the search-results Management 
Department 220 with an abstract. 

[0071 ]ln S209. [ whether processing at a front step was able to extract position information, 
and ] That is, it is distinguished by the search-results Management Department 220 whether 
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the position information included common to ail the records of the search-results file 320 
existed, if the result of this distinction becomes in Yes, processing will progress to S210, and if 
No becomes, processing will progress to S213. 

[0072]ln S210, the position information extracted by the processing of S208 mentioned above 
and the abstract which is matched and is stored in the position information in the search- 
results file 320 are stored in the search-results list file 330 by the search-results Management 
Department 220. 

[0073]The number of the attribute flag which is matched and is stored in the position 
information extracted by the processing of S208 mentioned above in the search-results file 320 
in S21 1 is calculated by the search-results Management Department 220 for every position 
information of the, This counting result is stored in the search-results list file 330 as an attribute 
point size. 

[0074]ln S212, the search-results list file 330 is sorted by the search-results Management 
Department 220 so that it may become descending of the enumerated data of the attribute 
point calculated by the front step. In S214 which the HTML file which expresses the contents of 
the search-results list after sorting by a Web page in S213 based on the search-results list file 
330 is created by the HTML preparing part 410, and follows, The created HTML file addresses 
to the browser 30 which is the transmitting origin of the search formula mentioned above by 
the WWW server part 400, and is sent out to it, and this retrieval processing is completed. 
[0075]Processing to the above is retrieval processing. Next, the case where what was shown 
in drawing 9 as the index file 310 is stored in the information database Management 
Department 300 is made into an example, and this retrieval processing is explained. First, if a 
search formula is sent from the browser 30, the result of distinction of S201 will serve as Yes, 
and logging of a search word will be performed in S202 continuing. Here, each word of the 
"hamburger" and the "tomato" should be started as a search word as a result of processing of 
S202. 

[0076]lf logging of a search word is completed, processing will progress to S203, first, search 
of a search word "hamburger" is performed about the entry of the index file 310, and the record 
about the entry "hamburger" in drawing 9 is discovered. Therefore, the result of the 
discrimination processing of S204 serves as Yes, and processing follows it to S205. 
[0077]ln S205, all of position information, an abstract, and an attribute flag are acquired from 
the discovered record, and the record which consists of the position information, an abstract, 
and an attribute flag in S206 continuing is stored in the search-results file 320. Then, although 
discrimination processing in S207 is performed, since processing of search of S203 is not yet 
performed about the inside "tomato" of the search word started by the processing of S202 
mentioned above, the result of the discrimination processing of S207 serves as No, and 
processing returns to S203. 

[0078]Henceforth, the same processing as the search word "hamburger" mentioned above 
about the search word "tomato" is performed, The record which the record about an entry 
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"tomato" is discovered from the index file 310 shown in drawing 9, and consists of the position 
information, character string, and the link flag and the search word "tomato" in the record is 
stored in the search-results file 320. 

[0079]The contents of the search-results file 320 generated by processing to the above are 
shown in drawing 1 1 . After the search-results file 320 shown in this drawing 1 1 is generated, 
the result of the discrimination processing of S207 serves as No, and processing progresses to 
S208. 

[0080] Extraction of the position information included common to all the records of the search- 
results file 320 in S208 is performed, as a result -- three, "the document 1", the "document 2", 
and the "document 3", are extracted here as position information included common to a 
"hamburger" and both the records of a "tomato" - ** - it carries out. Therefore, the result of 
the discrimination processing of Scontinuing 209 serves as Yes, and processing progresses to 
S210. 

[0081]Three position information extracted in S210, "the document 1", the "document 2", and 
the "document 3", In S21 1 which the character string which is matched and is stored in the 
position information in the search-results file 320 is stored in the search-results list file 330, 
and continues, The number of the attribute flag which is matched and is stored in each of the 
extracted position information "document 1", the "document 2", and the "document 3" is 
calculated, respectively, and the counting result is stored in the search-results list file 330 as 
an attribute point size. 

[0082] Drawing 12 is explained here. The figure shows the contents of the search-results list file 
330, and what is shown in the figure (a) is created as the search-results list file 330 by the 
processing to S211 mentioned above. In the search-results file 320 shown in drawing 11 , since 
a total of six attribute flags about "the document 1" are stored, the attribute point size about the 
position information "document 1" in the search-results list file 330 shown in drawing 12 (a) is 
set to "6." 

[0083]From the search-results file 320 same also about the attribute point of the "document 2" 
and the "document 3" and shown in drawin g 11 . The attribute point size about the "document 
2" in the search-results list file 330 shown in drawing 12 (a) is set to "0", and the attribute point 
size about "the document 3" is set to "3." 

[0084]ln S212 continuing, creation of the search-results list file 330 which shows drawing 12 
(a) the contents by processing to S211 mentioned above will perform sorting of the search- 
results list file 330 so that it may become descending of an attribute point size value. The result 
to which sorting based on an attribute point size was carried out to the search-results list file 
330 of drawing 12 (a) is shown in drawing 12 (b), and the turn of each line is rearranged in 
order of the "document 1" with a high attribute point, the "document 3", and the "document 2." 
[0085]Then, in S213, the HTML file which the HTML file which expresses the contents of the 
search-results list file 330 to which sorting was performed like drawing 12 (b) by a Web page 
was created, and was created in S214 continuing is sent out, and this retrieval processing is 

http://www4.ipdlinpit.go.jp/cgi-bin/tran_web_cgi_ejj 3/9/201 0 



JP,2003-030228,A [DETAILED DESCRIPTION] 



Page 16 of 17 



completed. 

[0086]The example of a screen of the Web page which shows the result of information retrieval 
displayed when the created HTML file is perused by the browser 30 is shown in drawing 13 . . 
Are preferentially arranged from what it is the search results about the word of a "hamburger" 
and a "tomato", and is expected to acquire the information that importance is high in the 
screen shown in the figure. The hyperlink to Web page 20 respectively shown by that position 
information is embedded at the position information on "the document 1", the "document 3", 
and the "document 2", and facilities are given to the user of these search results. 
[0087]ln the analysis of the HTML sauce about Web page 20 in the embodiment described by 
the above, although it has judged whether the character string is emphasized in Web page 20 
based on description of the <FONT> tag and the <B> tag, Based on description of other tags, 
it may be made to perform this judgment. As an example of a tag employable as the judgment 
of this emphasis, <l> on which a character string is displayed with an italic character The <U> 
tag which gives an underline to a tag or a character string, Or at a standard browser, there are 
a <STRONG> tag etc. which can enable it to make the character string able to pronounce 
strongly by the voice browser which reads out with a sound the text indicated to the Web page 
of that on which a character string is only only displayed in a bold letter character. Based on 
the <FONT> tag in which the FACE attribute for specifying the kind of font used for the display 
of a character string is specified, It may be made for the character string as which a different 
font from that by which normal use is carried out to the display sake of the Web page is 
specified to judge with what is emphasized in the Web page. 

[0088]The control program for making it carry out to the computer which has the standard 
composition which mentioned above the index production processing and retrieval processing 
which the information site 1 was performing in the embodiment of explained this invention by 
the above, and the same processing is created, By making the control program read into the 
computer, and performing it, this invention can be carried out by such computer. 
[0089]lt is also possible to carry out this invention by computer by making such a control 
program record on the recording medium which can be read by computer, making the program 
read from a recording medium to a computer, and performing it. The example of the possible 
recording medium of reading the control program made to record by computer is shown in 
drawing 14 . The memory storage 502, such as ROM with which the computer 501 is equipped 
as built-in or external attachment as a recording medium, for example as shown in the figure, 
and a hard disk drive, or a flexible disk, Portable good signifier recording-media 503 grades, 
such as MO (magneto-optical disc), CD-ROM, and DVD-ROM, can be used. A recording 
medium may be the memory storage 506 which is connected with the computer 501 via the 
network 504 and with which the computer which functions as the program server 505 is 
provided. In this case, the transmission signal acquired by modulating a subcarrier with the 
data signal expressing a control program, The control program concerned can be executed 
now by making it transmit through the network 504 which is a transmission medium from the 
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program server 5055, restoring to the transmission signal received by computer 501 , and 

reproducing a control program. 

[0090] 

[Effect of the lnvention]According to this invention, in the word which constitutes the character 
string contained in the document information currently released on the communication 
network. By matching the position information which shows the position of the document 
information, and the emphasis attributes which show the emphasis given to the character 
string, and registering with an index file. When the index file is searched based on the word 
showing a retrieval object, Since that with which emphasis attributes are matched can be given 
priority to and shown to the word matched with the position information among the position 
information acquired by the search, an information retrieval person can be provided with a 
more suitable information retrieval result to a retrieval object. 



[Translation done.] 
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1 This document has been translated by computer. So the translation may not reflect the 
original precisely. 

2.**** shows the word which can not be translated. 
3.ln the drawings, any words are not translated. 



[Field of the lnvention]This invention relates to the art of enabling it to provide the information 
which agreed appropriately by the demand, to the demand of search especially about the art of 
retrieving information. 



[Translation done.] 
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LThis document has been translated by computer. So the translation may not reflect the 
original precisely. 

2.**** shows the word which can not be translated. 
3. In the drawings, any words are not translated. 



[Description of the Prior Art] In recent years, the number of the Web pages provided by the 
WWW (WorldWideWeb) system on the Internet is continuing increasing explosively by the 
spread of the Internet. On the Internet, many search engines which provide the service which 
retrieves the information made into the purpose out of this huge information are established. 
[0003]There are some which are called the robot type as one of the methods which collects the 
information on a network of a search engine. In a robot type search engine, the robot program 
called a spider or a crawler is started periodically, Automatic collection of the HTML 
(HyperText Markup Language) file expressing the Web page currently exhibited on the Internet 
is performed. When information retrieval is performed and the information retrieval person 
using a search engine gives a closely related keyword to the target information at a search 
engine site, Processing which extracts that in which the keyword was contained from the 
collected file is performed, and an information retrieval person is provided with the list of Web 
pages which are the keyword and which are contained as search results with the information 
which shows the logical position on the Internet about the Web page. 



[Translation done ] 
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LThis document has been translated by computer. So the translation may not reflect the 
original precisely. 

2.**** shows the word which can not be translated. 
3.ln the drawings, any words are not translated. 



[Effect of the lnvention]ln the word which constitutes the character string contained in the 
document information currently released on the communication network from this invention. By 
matching the position information which shows the position of the document information, and 
the emphasis attributes which show the emphasis given to the character string, and registering 
with an index file. When the index file is searched based on the word showing a retrieval 
object, that by which emphasis attributes are matched with the word matched with the position 
information among the position information acquired by the search can be given priority to and 
shown. 

Therefore, an information retrieval person can be provided with a more suitable information 
retrieval result to a retrieval object. 



[Translation done.] 
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LThis document has been translated by computer. So the translation may not reflect the 
original precisely. 

2.**** shows the word which can not be translated. 
3.ln the drawings, any words are not translated. 



[Problem(s) to be Solved by the lnvention]Generally, since the robot type search engine is 
performing [ no ] automatically processings of a to [ from collection of information / offer of 
search results ] by computer and operation of the information by judgment of human being 
intervenes there, The arrangement about the quality of the genre to which the collected 
information belongs, or its information is not made. Therefore, if search by coincidence of a 
mere keyword was performed on the occasion of search of information, a Web page including 
important information is buried in search results, or. Or there were not few cases where it will 
be mostly contained in search results only in about the Web page in which what is called a 
search noise, i.e., the low information on usefulness, is contained. 
[0005]Making more suitable the result of the information retrieval which a search engine 
provides in view of the above problem to an information retrieval person's retrieval object is the 
issue which this invention tends to solve. 



[Translation done.] 
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1 This document has been translated by computer. So the translation may not reflect the 
original precisely. 

2.**** shows the word which can not be translated. 
3.ln the drawings, any words are not translated. 



[Means for Solving the Problem]A word contained in document information to which this 
invention is opened on a communication network, An index file which matches position 
information which shows a document information position in which it is the information which 
shows a logical position on this communication network, and information containing this word 
exists is prepared, It is premised on a system or a method of showing position information 
corresponding to a word which searches this index file based on a word showing an object of 
search, and expresses this retrieval object. 

[0007]And an information retrieval system which is one of the modes of this invention, An 
emphasis-attributes acquisition means which acquires emphasis attributes which are attributes 
given to a character string contained in said document information, and show emphasis, In a 
word extracted by extraction means to extract a word from said character string, and said 
extraction means. A registration means to match said position information about this word, and 
said emphasis attributes given to a character string of extraction origin of this word, and to 
register with said index file, A search means to acquire position information which searches 
said index file based on a word showing said retrieval object, and is matched with this word 
from this index file, A presenting means which gives priority to that by which said emphasis 
attributes are matched with a word which this search means made an object of search of this 
position information in said index file among position information acquired by said search 
means, and presents this position information, SUBJECT mentioned above is solved by 
constituting so that it may ****. 

[0008]An attribute given to a character string contained in said document information here is 
an attribute which shows color of a character used in order to display an attribute which shows 
a size of a character used when displaying this character string, for example, or this character 
string. It is possible that document information containing a word emphasis is instructed to be 
has a high possibility that information that importance is high is included about the word. 
Therefore, when two or more document information corresponding to a word which shows a 
search condition is opened to a communication network according to composition mentioned 
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above, Since the word is emphasized, and priority is given to position information about 
document information considered that importance is high, it makes and it is shown, a result of 
information retrieval will become more suitable to an information retrieval person's retrieval 
object. 

[0009]ln an information retrieval system concerning this invention mentioned above, said 
emphasis-attributes acquisition means, Afrequency-of-occurrence calculating means which 
computes the frequency of occurrence in this document information about an attribute given to 
a character string contained in said document information for this every attribute, An emphasis- 
attributes setting-out means to set up a standard which distinguishes whether said attributes 
are said emphasis attributes based on the frequency of occurrence for this every attribute, It 
may constitute so that it may have an emphasis-attributes discriminating means which 
distinguishes whether attributes given to a character string contained in said document 
information are said emphasis attributes based on said standard. 
[0010]Since it can judge now that a character string to which a unique attribute is given in 
document information is emphasized in that document information according to this 
composition, this document information can be registered into an index file for a word 
contained in this character string as what has high importance. 

[001 1]ln an information retrieval system concerning this invention mentioned above, said 
emphasis-attributes acquisition means, When making into a bold letter a character used in 
order that an attribute given to a character string contained in said document information may 
display this character string is shown, it may be made to consider that these attributes are said 
emphasis attributes. 

[0012]According to this composition, in order to show emphasis of a character string in 
document information, the attribute of a purport that a display by a bold letter currently 
performed widely is performed can be promptly judged with emphasis attributes. In an 
information retrieval system concerning this invention mentioned above, said presenting 
means, Priority is given to a thing which has many number of said emphasis attributes 
matched with a word which this search means made an object of search of this position 
information in said search file among position information acquired by said search means, and 
it may be made to show this position information. 

[0013]According to this composition, for a word by which a thing which has many number of 
emphasis attributes given to a certain character string is contained in this character string, this 
document information can be registered now into an index file as what has higher importance, 
and a result of information retrieval will become still more suitable to an information retrieval 
person's retrieval object. 

[0014]An information retrieval method which is one of the another modes of this invention, 
Emphasis attributes which are attributes given to a character string contained in said document 
information, and show emphasis are acquired, Match said position information about this word, 
and said emphasis attributes given to a character string of extraction origin of this word with a 
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word which extracted a word from said character string and was extracted from said character 
string, and it registers with said index file, Position information which searches said index file 
based on a word showing said retrieval object, and is matched with this word is acquired from 
this index file, The same operation and effect as an information retrieval system concerning 
this invention mentioned above are acquired by giving priority to that by which said emphasis 
attributes are matched with a word made into an object of search of this position information in 
said search file among position information acquired by said search, and showing this position 
information. 

[0015]SUBJECT mentioned above by making a computer execute the program also in a 
program for making processing which consists of the same procedure as an information 
retrieval method concerning this invention mentioned above perform to a computer is solvable. 

[0016] 

[Embodiment of the lnvention]Hereafter, an embodiment of the invention is described based on 
a drawing. Drawing 1 is a figure in which the information retrieval site which carries out this 
invention shows the entire configuration of the communication network which provides an 
information search service. 

[0017]ln drawing 1 , the Internet 4 which is all a communication network is accessed, and data 
can be delivered [ the information retrieval site 1 the offer-of-information site 2a, 2b, 2c, 2d, 
and the user terminals 3a and 3b ] and received mutually. The information retrieval site 1 is a 
WWW server system which provides a robot search type information search service for the 
user terminals 3a and 3b, is provided with the Research and Data Processing Department 100, 
the information retrieval Management Department 200, the information database Management 
Department 300, and the WWW server Management Department 400, and is constituted. 
[0018]The Research and Data Processing Department 100 performs automatic collection of 
the information currently released on the Internet 4, and accumulates the collected information 
in the information database Management Department 300. The information retrieval 
Management Department 200 retrieves the information accumulated in the information 
database Management Department 300 according to the demand of the information retrieval 
sent via the Internet 4, and returns the result of the search to a requiring agency. 
[0019]At the information database Management Department 300, accumulation of the 
information collected by the Research and Data Processing Department 100 and search of the 
information by the information retrieval Management Department 200 are performed. The 
processing by which the WWW server part 400 transmits the collected information which is 
sent via the Internet 4 to the Research and Data Processing Department 100, Processing 
which transmits the demand of the information retrieval sent via the Internet 4 to the 
information retrieval Management Department 200, and processing of sending out of a Web 
page in which the information which shows the result of the information retrieval sent from the 
information retrieval Management Department 200 is expressed are performed. 
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[0020]The offer-of-information site 2a, 2b, and 2c and 2d are WWW server systems which 
exhibit Web pages 20a, 20b, 20c, and 20d on the Internet 4, respectively. Although four offer- 
of-information sites are shown in drawing 1, the number of the offer-of-information sites 
connected to the Internet 4 may be arbitrary. 

[0021 ]The user terminals 3a and 3b, respectively The offer-of-information site 2a, 2b, 2c, And it 
is a computer which can perform the browsers 30a and 30b which are the software which 
peruses the Web page provided from 2 d or the information retrieval site 1 , and is operated by 
the information retrieval person who requests search of the information currently released on 
the Internet 4 to the information retrieval site 1. Although two users are shown in drawing 1 , the 
number of the user terminals connected to the Internet 4 may also be arbitrary. 
[0022]These information retrieval sites 1, the offer-of-information site 2a, 2b, 2c and 2d, and 
the user terminals 3a and 3b, The computer by which all have standard composition, i.e., CPU 
which controls each component by executing a control program, The storage parts store used 
as the work area at the time of the memory and CPU of a control program which consist of a 
ROM, RAM, a magnetic storage device, etc., and make CPU control each component 
executing a control program, or a storage area of various data, It can also constitute using a 
computer provided with the input part from which various kinds of data corresponding to 
operation by a user is acquired, the outputting part which show a display etc. various kinds of 
data and of which a user is notified, and the l/F part which provides the interface function for 
connecting with a network. 

[0023]Next, dr awing 2 is explained. The figure shows the detailed composition of the 
information retrieval site 1 in drawing 1 which carries out this invention. As shown in drawing 2 , 
the Research and Data Processing Department 100 has the Web page collection Management 
Department 110 and the index production part 120, and is constituted, The information 
management retrieval part 200 is provided with the information retrieval section 210 and the 
search-results Management Department 220, and is constituted, The database manager 300 
has the index file 310, the search-results file 320, and the search-results list file 330, and is 
constituted, and WWW server 400 is provided with the HTML preparing part 410, and is 
constituted. 

[0024]The Web page collection Management Department 1 10 performs periodically automatic 
collection of Web page 20 currently exhibited on the Internet 4. The position information on 
Web page 20 by which the index build Management Department 120 was collected by the 
Web page collection Management Department 110, That is, the record used as the index 
which can lengthen the position information which shows the logical position on the Internet 4 
in which Web page 20 exists is created, and it registers with the index file 310. The index build 
Management Department 120 has the Web page analyzing parts 121, the word extraction 
Management Department 123, and the index registering part 126, and is constituted. 
[0025]The Web page analyzing parts 121 create the HTML filter table 122 which makes the 
unit of a record each HTML tag described by the text of the HTML file which analyzes Web 
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page 20 and is expressing Web page 20. processing which analyzes the conditions of the 
character style which is emphasized when they are displayed as a screen of Web page 20 in 
the character string shown in the HTML filter table 122, and which can be been [ character 
style / it ] rich and made being performed by the character emphasis analyzing parts 124, and 
at the word extraction Management Department 123, Analysis of the character string shown in 
the HTML filter table 122 is conducted by the character string analyzing parts 125, and a word 
is extracted from the character string. 

[0026]The logical position information [ the index registering part 126 makes an entry the word 
extracted by the character string analyzing parts 125, and / entry / the ] on the Internet 4 about 
Web page 20, The index record in which the abstract of Web page 20 in which the word was 
contained, and the form set as the word by Web page 20 matched the attribute flag which 
shows that it agrees on the conditions of the character style obtained in the analysis in the 
character emphasis analyzing parts 124 is registered into the index file 310. 
[0027]The information retrieval section 210 acquires the demand of the information retrieval 
sent from the user terminal by control of the browser 30 currently performed with one of the 
user terminals connected to the Internet 4 from the WWW server part 400, The search formula 
showing the conditions of the information retrieval is taken out from the demand, and it stores 
in the search formula storage 21 1 . And the word (keyword) which searches the index file 310 
and is shown in the search formula acquires the index record used as a title, and stores in the 
search-results file 320. 

[0028]lf search by the information retrieval section 210 is completed, the search-results 
Management Department 220, The position information and the abstract which are shown in 
the index data stored in the search-results file 320, and the total number of the attribute flag 
attached with the index record corresponding to the position information are stored in the 
search-results list file 330. And the position information stored in the search-results list file 330 
is sorted according to the total number. 

[0029]The HTML preparing part 410 creates the HTML file expressing the Web page which 
receives the search-results list which consists of sorted position information which is stored in 
the search-results list file 330 and as which the search-results list is expressed. The created 
HTML file is addressed to the user terminal in which the browser 30 is performed, and is sent 
out to the Internet 4 by the WWW server part 400. 

[0030]Next, the details of processing of the collection of a Web page performed in the 
Research and Data Processing Department 100 which the information retrieval site 1 has, and 
generation of an index are explained. Drawin g 3 shows the example of Web page 20 which is 
opened to the Internet 4 and collected by the information retrieval site 1. In the figure, the 
browser's 30 inspection of the HTML sauce shown in (b) will display the screen shown in the 
figure (a). 

[0031] Drawing 4 is explained here. The figure is a flow chart which shows the contents of 
processing of the index production processing performed in the Research and Data 
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Processing Department 100. By performing this processing, collection of a Web page and 
generation of an index are performed in the Research and Data Processing Department 100. 
First, in S101 , only when it was distinguished at the Web page collection Management 
Department 1 1 0 whether the present date is the collection designated date of Web page 20 
specified beforehand, this decision result is set to Yes and the present becomes that 
designated date, processing progresses to S102. Although the method of specification of this 
date is arbitrary, specification called the monthly final day of month end, etc. is performed, for 
example. 

[0032]ln S102, processing of a round and collection of Web page 20 currently exhibited on the 
Internet 4 is performed by the Web page collection Management Department 110. The 
technique of this round and collection should just use as it is what is performed from the former 
by the well-known robot type search engine. 

[0033]ln the Web page analyzing parts 121, the initial value 1 is substituted for the variable m 
used as page pointers which are pointers for pointing at a time to 1 page of Web pages 20 of a 
large number collected at the front step S103. The structure of the page to which it points by 
the current value of the page pointers m in Web page 20 collected by processing of S102 in 
S104 is analyzed by the Web page analyzing parts 121 , The HTML filter table 122 is 
generated by the Web page analyzing parts 121 in S105 continuing. 
[0034]The HTML filter table generated from the Web page shown in drawing 3 is shown in 
drawing 5 . Analysis of the HTML sauce shown in drawing 3 (b) by the Web page analyzing 
parts 121 will generate the HTML filter table shown in drawing 5 . When it explains further, 
referring to drawing 3 (b) for the contents of processing of S104, in the Web page analyzing 
parts 121. The text of the HTML sauce of an analytical object, i.e., the portion pinched between 
the start tag of <BODY> and the end tag, is made into the object of analysis, and the structure 
of each sentence where the <BR> tag (line feed tag) in the portion is considered as a pause of 
a sentence, and is contained in the text is analyzed. 

[0035]lf signs that the HTML filter tabje shown in drawing 5 from the HTML sauce shown in 
drawing 3 (b) is created are explained, First, let the portion pinched between the start tag of 
the <BODY> tag and end tag which are the description parts of the text in HTML sauce, i.e., 
the portion put between the <BODY> tag and the </BODY> tag, be an object of the analysis by 
processing of S 104. 

[0036]here -- first - an analytical object - a portion -- it can set - the beginning - < - BR -- > - 
- a tag - describing - having **** - a part up to - a portion - namely, - < »- FONT 
SIZE - = - " - six - " - COLOR -- = " # - FF 0000 - " - > - < - B « > easy 
cooking - < - /- B - > - < /- FONT > - " - a portion is analyzed. <FONT 
SIZE="6"COLOR="#FF0000"> Here the becoming tag, the "easy dish" described by the 
portion pinched between the start tag of <FONT>, and the end tag a character string - 
character size - "6" - considering it as a size - and a character color - H #FF0000 n ~ what is 
displayed as a color shown numerically is shown. The color shown for this figure is red. 
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[0037]Displaying the character string "the easy dish" <B> [ which is described by the portion by 
which the becoming tag is sandwiched between the start tag of <FONT> and the end tag ] 
Becoming in a bold letter is shown. The analysis processing by S104 conducts analysis 
mentioned above, and the record shown in the 1st line of drawing 5 which means the contents 
of this analysis result is stored in the HTML page filter 122 by processing of S105 performed 
next. An "easy dish" is stored in the column of a "character string" when this record is 
explained, "STRING" which shows that an "easy dish" is a character string is stored in the 
column of "classification", and the attribute which shows that "6" and a "color" are displayed for 
"character size" as "red" about the character string of an "easy dish" is stored in the column of 
a "character string attribute." By furthermore storing the flag "1" in the column of the "bold 
letter" in a "character string attribute" shows that displaying the character string of an "easy 
dish" in a bold letter was shown. And the record in which existence of the <BR> tag which only 
makes it the contents for "classification" to be "BR" after the is shown is stored in the 2nd line 
of the HTML page filter 122. 

[0038]Analysis of the remaining portion in the text part of the HTML sauce shown in drawing 3 
(b) is conducted similarly hereafter, and the HTML page filter 122 shown in drawing 5 in this 
way is generated. Although the SIZE attribute in the <FONT> tag may be shown like "+1", "-1", 
etc. by the relative value over the usual displayed character size, When such, the relative 
value is registered as character size, and about the character string by which specification of 
character size is not made, "0" is registered as character size. 

[0039]lt returns to explanation of drawing 4 and processing which analyzes the standard of the 
character string attribute performing setting out of the standard of an index attribute, i.e., 
highlighting, can consider that it is set up, and sets up the standard is performed by the 
character emphasis analyzing parts 124 in the character string stored in the HTML page filter 
122 in S106. The details of this processing are mentioned later. 

[0040]The initial value 1 is substituted for the variable n used as a filter pointer which is a 
pointer for specifying one [ at a time ] each line (record) of the HTML filter table 122 in order by 
the character string analyzing parts 125 S107. It is distinguished by the character string 
analyzing parts 125 whether the data in which the classification of the character string shown 
in the line specified by the filter pointer n mentioned above in S108 is shown is "STRING", if 
the result of this distinction becomes in Yes, processing will progress to S109, and if No 
becomes, processing will progress to S1 15. 

[0041]ln S109, processing which sets an index attribute as the character string shown in the 
line specified by the filter pointer n based on the standard set up by processing of S106 is 
performed by the character string analyzing parts 125. The details of this processing are also 
mentioned later. Then, in [ processing which starts a word from the character string shown in 
the line specified by the filter pointer n in S1 10 is performed by the character string analyzing 
parts 125, and ] S1 1 1 , Processing from which the part of speech extracts the word which is a 
noun is continuously performed by the character string analyzing parts 125 from the started 
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word. 

[0042]A well-known method is adopted as processing of logging of the word in S1 10. The 
method made into the word which used what is called a morphological analysis, for example, 
acquired the canonical form of that word for the part of speech and conjugated form of the 
word which were started as a method of this common knowledge using various kinds of 
dictionaries, and started the word of that canonical form from the character string, There are 
what is called an N gram method etc. that start the word of length N mechanically in order 
shifting logging of a character string of one character at a time from the head of the character 
string. 

[0043]ln S1 12, it is distinguished by the character string analyzing parts 125 whether the word 
extracted by the processing of S1 1 1 mentioned above existed, if this decision result becomes 
in Yes, processing will progress to S1 13, and if No becomes, processing will progress to S1 15. 
The position information which is the page which made the title the word extracted by the 
processing of S1 1 1 mentioned above in S1 13, and in which the word was contained, The 
index which matched with the word of the title the abstract of the text indicated to the page and 
the character attribute set up by processing of S109 to the character string in which the word 
was contained is generated by the index registering part 126, The index generated by 
processing of Scontinuing 114 is registered into the index file 310. 
[0044]ln S1 15, directions of the filter pointer n mentioned above by the character string 
analyzing parts 125 are advanced only 1. It is distinguished by the character string analyzing 
parts 125 whether in S1 16, the line specified by the present numerical value of the filter pointer 
n has exceeded the last line that exists in the HTML filter table 122, If this discriminated result 
becomes in Yes, processing will progress to S1 17, and if No becomes, the processing which 
processing returned and mentioned above to S108 will be repeated. 
[0045]ln S117, the directions of the page pointers m mentioned above by the Web page 
analyzing parts 121 are advanced only 1. It is distinguished by the Web page analyzing parts 
121 whether in S1 18, the page specified by the present numerical value of the page pointers m 
has exceeded the last page of Web page 20 collected by the Web page collection 
Management Department 110, If the result of this distinction becomes in Yes, this index 
production processing will be completed. On the other hand, if the result of this discrimination 
processing becomes in No, the processing which processing returned and mentioned above to 
S104 will be repeated. 

[0046] Processing to the above is index production processing. Next, the details of the standard 
setting processing of an index attribute performed by the character emphasis analyzing parts 
124 as processing of S106 in the index production processing mentioned above are explained. 
Drawing 6 is a flow chart which shows the contents of processing of the standard setting 
processing of an index attribute. 

[0047]First, in S121, the "classification" in the HTML filter table 122 is computed for every 
character size [ in / in the sum total of the number of characters of the character string of a line 
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which is "STRING" / a "character string attribute" ]. Next, in S122, the rate of the number of 
characters of each character size to the number of characters of all the character strings 
shown, the incidence 122, i.e., the HTML filter table, of each character size, is computed. 
[0048]ln S123, the standard incidence S about the character size set up beforehand is 
acquired. In S124, it is distinguished in S125 to which total addition is carried out and the 
incidence computed by processing of S122 follows descending of character size whether the 
cumulative value exceeded the reference value S. And when this discriminated result is set to 
Yes, processing progresses to S126. On the other hand, while this discriminated result is No, 
processing of S124 is repeated. 

[0049]ln S126, when the result of the discrimination processing of a front step is set to Yes, the 
larger character size by one than the character size corresponding to the incidence added by 
the processing of S124 in front of that is set up as the reference letter size Esize. In the 
standard setting processing of the index attribute mentioned later, it considers that the 
character string to which the attribute of the character size more than this reference letter size 
Esize is set is a character string which is performing highlighting in Web page 20. 
[0050]ln S127, the "classification" in the HTML filter table 122 is computed for every character 
color [ in / in the sum total of the number of characters of all the character strings shown in the 
line which is "STRING" / a "character string attribute" ]. In S128, the rate of the number of 
characters of each character color to the number of characters of all the character strings 
shown, the incidence 122, i.e., the HTML filter table, of each character color, is computed. 
[0051 ]ln S129, the standard incidence C about the character color set up beforehand is 
acquired. In S130, it is distinguished whether the character color Cn whose incidence is more 
than the standard incidence C exists, and only when this discriminated result is Yes, in S131, 
this character color Cn is set up as the reference color Ecolor. In the standard setting 
processing of the index attribute mentioned later, it is considered that this reference color 
Ecolor is a character string to which the character string to which the attribute of a different 
character color is set is performing highlighting in Web page 20. 

[0052]After finishing processing of S130 and S131 , the standard setting processing of this 
index attribute is completed, and processing returns to drawing 4 mentioned above. 
Processing to the above is the standard setting processing of an index attribute. Next, the 
standard setting processing of the index attribute mentioned above is further explained using 
the example of drawing 8 . 

[0053] Drawing 8 (A) is a table in which showing the incidence of the character contained in the 
becoming Web page, and showing the incidence of each character size from which ** was 
obtained by processing to S122, and a table showing the incidence of each character color 
from which ** was obtained by processing to S128 document 1 . 
[0054]Now, suppose that the standard incidence S about the character size acquired by 
processing of S123 was 10%. ** It set, and the sum total of the incidence about the thing more 
than "5" is +5% [ 3% of ] = 8%, and character size is less than the standard incidence S which 
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mentioned this value above. On the other hand, it is over the standard incidence S in which the 
sum total of the incidence about the thing more than "4" is +70% [ 3% of +5% of ] = 78%, and 
character size mentioned this value above. Therefore, in the discrimination processing of 
S125, when character size computes the sum total of the incidence about the thing more than 
"4" in processing of S124, the result serves as Yes. And in S126 performed at this time, the 
larger character size by one than the character size corresponding to the incidence added by 
the processing of S124 in front of that, i.e., "5", is set up as the reference letter size Esize. 
Therefore, in the standard setting processing of the index attribute mentioned later, it considers 
that the character string to which the attribute of the character size more than "5" is set, i.e., 
the character string by which character size is set as "5" or "6", is a character string which is 
performing highlighting in Web page 20. 

[0055]Next, considering it as the thing of the maximum of the incidence about the character 
color of the character contained in the document 1 about the standard incidence C about the 
character color acquired by processing of S129 assumes that it was set up beforehand. ** Set, 
the character color of the maximum [ incidence ] is "black", and the incidence is 90%. 
Therefore, by processing of S129, since this incidence is set up as the standard incidence C, 
by processing of continuing S130 and S131, the black chisel whose incidence is not less than 
90% is set up as the reference color Ecolor at the character color whose incidence is more 
than the standard incidence C, i.e., here. Therefore, in the standard setting processing of the 
index attribute mentioned later, it is considered that black is a character string to which the 
character string to which the attribute of a different character color is set, i.e., the character 
string whose character color is "red" or "blue" here, is performing highlighting in Web page 20. 
[0056] Drawing 8 (B) is a table in which showing the incidence of the character contained in the 
becoming Web page, and showing the incidence of each character size from which ** was 
obtained by processing to S122, and a table showing the incidence of each character color 
from which ** was obtained by processing to S128 document 2. 
[0057]Now, suppose that the standard incidence S about the character size acquired by 
processing of S123 was 10% like the Web page of the document 1 . ** It set, and the sum total 
of the incidence about the thing more than "4" is 1%, and character size is less than the 
standard incidence S which mentioned this value above. On the other hand, it is over the 
standard incidence S in which the sum total of the incidence about the thing more than "3" is 
+99% [ per % ] = 100%, and character size mentioned this value above. Therefore, in the 
discrimination processing of S125, when character size computes the sum total of the 
incidence about the thing more than "3" in processing of S124, the result serves as Yes. And in 
S126 performed at this time, the larger character size by one than the character size 
corresponding to the incidence added by the processing of S124 in front of that, i.e., "4", is set 
up as the reference letter size Esize. Therefore, in the standard setting processing of the index 
attribute mentioned later, it considers that the character string to which the attribute of the 
character size more than "4" is set, i.e., the character string by which character size is set as 
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"5", is a character string which is performing highlighting in Web page 20. 
[0058]Next, suppose that the standard incidence C about the character color acquired by 
processing of S129 was 10%. ** It sets and the character color whose incidence is more than 
the standard incidence C, i.e., the red whose incidence is not less than 10% here, and blue are 
set up as the reference color Ecolor in processing of S130 and S131. Therefore, in the 
standard setting processing of the index attribute mentioned later, it is considered that it is the 
character string to which the attribute of a different character color from red or blue both is set, 
i.e., the character string to which the character string whose character color is "black" here is 
performing highlighting in Web page 20. 

[0059]Next, the details of the setting processing of an index attribute performed by the 
character string analyzing parts 125 as processing of S109 in the index production processing 
mentioned above are explained. First, in S141, the character string attribute shown in the line 
specified by the current value of the filter pointer n in the HTML filter table 122 mentioned 
above is acquired. 

[0060]ln S142, the character size in the character string attribute acquired by processing of the 
front step, In S143 it is distinguished whether it is more than the reference letter size Esize set 
up by the standard setting processing of the index attribute mentioned above, and only when 
the result of this distinction is Yes, The character size flag with which the attribute given to the 
character string shown in the line specified by the current value of the filter pointer n mentioned 
above is defined as the index attribute buffer held temporarily is set to "1." 
[0061 ]ln S144, the character color in the character string attribute acquired by processing of 
S141 , In S145 only when the result of this distinction is Yes, namely, only when it is 
distinguished whether it differs from the reference letter color Ecolor set up by the standard 
setting processing of the index attribute mentioned above, and character colors differ, The 
character color flag with which the attribute given to the character string shown in the line 
specified by the current value of the filter pointer n mentioned above is defined as the index 
attribute buffer held temporarily is set to "1." 

[0062]ln S147 it is distinguished whether the flag "1" is stored in the column of the "bold letter" 
in the character string attribute acquired by processing of S141 in S146, and only when the 
result of this distinction is Yes, The bold letter flag with which the attribute given to the 
character string shown in the line specified by the current value of the filter pointer n mentioned 
above is defined as the index attribute buffer held temporarily is set to "1." 
[0063]After finishing processing of S146 and S147, the setting processing of this index 
attribute is completed and processing returns to drawing 4 mentioned above. Processing to the 
above is the setting processing of an index attribute. The data structure of the index file 310 
generated by performing word index production processing of drawing 4 explained by the 
above to Web page 20 illustrated to drawing 3 is shown in drawing 9. In the figure, although 
the logical position on the Internet 4 to which Web page 20 shown in drawin g 3 is opened is 
shown as "the document 1", of course, it does not matter even if URL (Uniform Resource 
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Locator) shows this position information, for example. 

[0064]lf the line of the entry "easy" in drawing 9 is made into an example and the index file 310 
is explained, The "easy" word is included in Web page 20 currently exhibited in the position 
"the document 1" [ on the Internet 4 ] Becoming, and it is shown in Web page 20 that all of 
highlighting according [ this word ] to character size, highlighting by a character color, and 
highlighting by a bold letter are made, highlighting mentioned above about the "easy" word 
when referring to drawing 3 - ******** - things are shown. 

[0065]Next, an example is shown and explained about the details of processing of the 
information retrieval performed at the information retrieval Management Department 200 which 
the information retrieval site 1 has. Drawing 10 is a flow chart which shows the contents of 
processing of the retrieval processing performed by the information management retrieval part 
200. First, the contents of processing of retrieval processing are explained along with the 
figures. 

[0066]ln S201, only when it is distinguished by the information retrieval section 210 whether 
the search formula in which the word which are the demand of information retrieval sent from 
the browser 30 and an object of that search is shown was received and this discriminated 
result is set to Yes, processing progresses to S202. In the information retrieval section 210, if a 
search formula is sent, the search formula is stored in the search formula storage 21 1 . 
[0067]ln S202, the sent search formula is analyzed by the information retrieval section 210, 
and the word which is a retrieval object is started from the search formula. Search of the entry 
of the index file 310 which used the search word as the key is performed by the information 
retrieval section 210 in the turn that the search word was started S203. 
[0068]ln S204, as a result of search at a front step, it is distinguished by the information 
retrieval section 21 0 whether the entry which is in agreement with a search word was 
discovered, if the result of this distinction becomes in Yes, processing will progress to S205, 
and if No becomes, processing will progress to S207. In S206 which all of position information, 
an abstract, and an attribute flag are acquired from the retrieval record in which the entry which 
is in agreement with a search word was contained by the information retrieval section 210, and 
continues by it in S205, The record which consists of an entry which is in agreement with this 
search word, and position information, an abstract and an attribute flag is stored in the search- 
results file 320 by the information retrieval section 210. 

[0069]lt is distinguished [ which was mentioned above about all the search words started in 
S207 by the processing of S202 mentioned above ] by the information retrieval section 210 
whether processing of search of S203 was performed, If the result of this distinction becomes 
in Yes, processing will progress to S208, if the result of this distinction becomes on the other 
hand in No, processing will return to S203 and processing mentioned above about the search 
word in which processing of search is not yet performed will be performed. 
[0070]The position information applicable to all the search words started by the processing of 
S202 mentioned above among the position information stored in the search-results file 320 in 
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S208 here, That is, the position information included common to all the records of the search- 
results file 320 is extracted from the search-results file 320 by the search-results Management 
Department 220 with an abstract. 

[0071]ln S209. [ whether processing at a front step was able to extract position information, 
and ] That is, it is distinguished by the search-results Management Department 220 whether 
the position information included common to all the records of the search-results file 320 
existed, if the result of this distinction becomes in Yes, processing will progress to S210, and if 
No becomes, processing will progress to S213. 

[0072]ln S210, the position information extracted by the processing of S208 mentioned above 
and the abstract which is matched and is stored in the position information in the search- 
results file 320 are stored in the search-results list file 330 by the search-results Management 
Department 220. 

[0073]The number of the attribute flag which is matched and is stored in the position 
information extracted by the processing of S208 mentioned above in the search-results file 320 
in S21 1 is calculated by the search-results Management Department 220 for every position 
information of the, This counting result is stored in the search-results list file 330 as an attribute 
point size. 

[0074]ln S212, the search-results list file 330 is sorted by the search-results Management 
Department 220 so that it may become descending of the enumerated data of the attribute 
point calculated by the front step. In S214 which the HTML file which expresses the contents of 
the search-results list after sorting by a Web page in S213 based on the search-results list file 
330 is created by the HTML preparing part 410, and follows, The created HTML file addresses 
to the browser 30 which is the transmitting origin of the search formula mentioned above by 
the WWW server part 400, and is sent out to it, and this retrieval processing is completed. 
[0075]Processing to the above is retrieval processing. Next, the case where what was shown 
in drawin g 9 as the index file 310 is stored in the information database Management 
Department 300 is made into an example, and this retrieval processing is explained. First, if a 
search formula is sent from the browser 30, the result of distinction of S201 will serve as Yes, 
and logging of a search word will be performed in S202 continuing. Here, each word of the 
"hamburger" and the "tomato" should be started as a search word as a result of processing of 
S202. 

[0076]lf logging of a search word is completed, processing will progress to S203, first, search 
of a search word "hamburger" is performed about the entry of the index file 310, and the record 
about the entry "hamburger" in drawing 9 is discovered. Therefore, the result of the 
discrimination processing of S204 serves as Yes, and processing follows it to S205. 
[0077]ln S205, all of position information, an abstract, and an attribute flag are acquired from 
the discovered record, and the record which consists of the position information, an abstract, 
and an attribute flag in S206 continuing is stored in the search-results file 320. Then, although 
discrimination processing in S207 is performed, since processing of search of S203 is not yet 
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performed about the inside "tomato" of the search word started by the processing of S202 
mentioned above, the result of the discrimination processing of S207 serves as No, and 
processing returns to S203. 

[0078]Henceforth, the same processing as the search word "hamburger" mentioned above 
about the search word "tomato" is performed, The record which the record about an entry 
"tomato" is discovered from the index file 310 shown in drawing 9 , and consists of the position 
information, character string, and the link flag and the search word "tomato" in the record is 
stored in the search-results file 320. 

[0079]The contents of the search-results file 320 generated by processing to the above are 
shown in drawing 1 1 . After the search-results file 320 shown in this drawing 11 is generated, 
the result of the discrimination processing of S207 serves as No, and processing progresses to 
S208. 

[0080]Extraction of the position information included common to all the records of the search- 
results file 320 in S208 is performed, as a result - three, "the document 1", the "document 2", 
and the "document 3", are extracted here as position information included common to a 
"hamburger" and both the records of a "tomato" - ** it carries out. Therefore, the result of 
the discrimination processing of Scontinuing 209 serves as Yes, and processing progresses to 
S210. 

[0081]Three position information extracted in S210, "the document 1", the "document 2", and 
the "document 3", In S21 1 which the character string which is matched and is stored in the 
position information in the search-results file 320 is stored in the search-results list file 330, 
and continues, The number of the attribute flag which is matched and is stored in each of the 
extracted position information "document 1", the "document 2", and the "document 3" is 
calculated, respectively, and the counting result is stored in the search-results list file 330 as 
an attribute point size. 

[0082] Drawing 12 is explained here. The figure shows the contents of the search-results list file 
330, and what is shown in the figure (a) is created as the search-results list file 330 by the 
processing to S211 mentioned above. In the search-results file 320 shown in drawin g 1 1 , since 
a total of six attribute flags about "the document 1" are stored, the attribute point size about the 
position information "document 1" in the search-results list file 330 shown in drawing 12 (a) is 
set to "6." 

[0083]From the search-results file 320 same also about the attribute point of the "document 2" 
and the "document 3" and shown in drawing 1 1 . The attribute point size about the "document 
2" in the search-results list file 330 shown in drawin g 12 (a) is set to "0", and the attribute point 
size about "the document 3" is set to "3." 

[0084]ln S212 continuing, creation of the search-results list file 330 which shows draw ing 12 
(a) the contents by processing to S21 1 mentioned above will perform sorting of the search- 
results list file 330 so that it may become descending of an attribute point size value. The result 
to which sorting based on an attribute point size was carried out to the search-results list file 
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330 of dr awing 12 (a) is shown in drawing 12 (b), and the turn of each line is rearranged in 
order of the "document 1" with a high attribute point, the "document 3", and the "document 2." 
[0085]Then, in S213, the HTML file which the HTML file which expresses the contents of the 
search-results list file 330 to which sorting was performed like drawing 12 (b) by a Web page 
was created, and was created in S214 continuing is sent out, and this retrieval processing is 
completed. 

[0086]The example of a screen of the Web page which shows the result of information retrieval 
displayed when the created HTML file is perused by the browser 30 is shown in drawing 1 3. . 
Are preferentially arranged from what it is the search results about the word of a "hamburger" 
and a "tomato", and is expected to acquire the information that importance is high in the 
screen shown in the figure. The hyperlink to Web page 20 respectively shown by that position 
information is embedded at the position information on "the document 1", the "document 3", 
and the "document 2", and facilities are given to the user of these search results. 
[0087]ln the analysis of the HTML sauce about Web page 20 in the embodiment described by 
the above, although it has judged whether the character string is emphasized in Web page 20 
based on description of the <FONT> tag and the <B> tag, Based on description of other tags, 
it may be made to perform this judgment. As an example of a tag employable as the judgment 
of this emphasis, <l> on which a character string is displayed with an italic character The <U> 
tag which gives an underline to a tag or a character string, Or at a standard browser, there are 
a <STRONG> tag etc. which can enable it to make the character string able to pronounce 
strongly by the voice browser which reads out with a sound the text indicated to the Web page 
of that on which a character string is only only displayed in a bold letter character. Based on 
the <FONT> tag in which the FACE attribute for specifying the kind of font used for the display 
of a character string is specified, It may be made for the character string as which a different 
font from that by which normal use is carried out to the display sake of the Web page is 
specified to judge with what is emphasized in the Web page. 

[0088]The control program for making it carry out to the computer which has the standard 
composition which mentioned above the index production processing and retrieval processing 
which the information site 1 was performing in the embodiment of explained this invention by 
the above, and the same processing is created, By making the control program read into the 
computer, and performing it, this invention can be carried out by such computer. 
[0089]lt is also possible to carry out this invention by computer by making such a control 
program record on the recording medium which can be read by computer, making the program 
read from a recording medium to a computer, and performing it. The example of the possible 
recording medium of reading the control program made to record by computer is shown in 
drawing 14 . The memory storage 502, such as ROM with which the computer 501 is equipped 
as built-in or external attachment as a recording medium, for example as shown in the figure, 
and a hard disk drive, or a flexible disk, Portable good signifier recording-media 503 grades, 
such as MO (magneto-optical disc), CD-ROM, and DVD-ROM, can be used. A recording 
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medium may be the memory storage 506 which is connected with the computer 501 via the 
network 504 and with which the computer which functions as the program server 505 is 
provided. In this case, the transmission signal acquired by modulating a subcarrier with the 
data signal expressing a control program, The control program concerned can be executed 
now by making it transmit through the network 504 which is a transmission medium from the 
program server 5055, restoring to the transmission signal received by computer 501 , and 
reproducing a control program. 
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1 This document has been translated by computer. So the translation may not reflect the 
original precisely. 

2.**** shows the word which can not be translated. 
3.ln the drawings, any words are not translated. 



[Brief Description of the Drawings] 

[Dr awing 1]The information retrieval site which carries out this invention is a figure showing the 
entire configuration of communication net TEWAKU which provides an information search 
service. 

[Dr awing 2] lt is a figure showing the detailed composition of an information retrieval site. 
[ Drawing 3]lt is a figure showing an example of a Web page. 

[Drawing 4]lt is a flow chart which shows the contents of processing of word index production 
processing. 

[Dra wing 5] It is a figure showing the example of a HTML filter table. 

[Drawing 6] lt is a flow chart which shows the contents of processing of the standard setting 

processing of an index attribute. 

[Dr awing 7]lt is a flow chart which shows the contents of processing of the setting processing 
of an index attribute. 

[Drawing 8]lt is a figure explaining the setting processing of an index attribute. 
[Dr awing 9]lt is a figure showing the data structure of an index file. 

[Drawing 10]lt is a flow chart which shows the contents of processing of retrieval processing. 

[Drawing 11]lt is a figure showing the example of an index result file. 

[ Drawing 1 2]lt is a figure showing the situation of sorting of an index result list file. 

[Dra wing 13] lt is a figure showing the example of a screen of the Web page which shows the 

result of information retrieval. 

[Drawing 14] It is a figure showing the example of the possible recording medium of reading the 

program made to record by computer. 

[Description of Notations] 

1 Information retrieval site 

2a, 2b, and 2c and 2d Offer-of-information site 

3a, 3b user terminal 

4 Internet 
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20, 20a, 20b, 20c, 20d Web page 

30, 30a, and 30b Browser 

100 Research and Data Processing Department 

110 Web page collection Management Department 

120 Index build Management Department 

121 Web page analyzing parts 

122 HTML filter table 

123 Word extraction Management Department 

124 Character emphasis analyzing parts 

125 Character string analyzing parts 

126 Index registering part 

200 Information retrieval Management Department 

210 Information retrieval section 

21 1 Search formula storage 

220 Search-results Management Department 

300 Database manager 

310 Index file 

320 Search-results file 

330 Search-results list file 

400 WWW server part 

410 HTML preparing part 

501 Computer 

502 and 506 Memory storage 

503 Portable good signifier recording media 

504 Network 

505 Program server 
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mmmcrMM] 

izth -xmm mmz^immmt ^nmwfx^i, 

T^&messMM&k znmttWTmmwyT'f mz 

mimmnmzm-t&mizm^Tmm^y r a iw> 
mm* n% -5 x mxm&tt&ttvbtix i * 4 ^aws^ 

[ If *H 2 ] iff ESSHfifSl fc£ A^-T ^ I. ifc^J fc^- 
*. 6 ftT v ^4 JRffii , RSrt^d £ 0txr? 4 k S tffl v ^ 

•$"4 fcibfcffl^ £» *i4 S^n^M * jjrT JUST* 4 i i: 

twafc-r***^ i izimcommmm^^^. 

mz^x&w&mwmiz&n&tmmmzmm&mz 
w^htammMMiH^k . 
mimmmrnmmm&x'fo&^frzmwi-t&mmz 
wmmt<r>\mmmzm-3\ ixm.%?? 4 

K # S ftT ^ 4 ISC^J C # x. h tiX V ^ 4 SHfeWRX^I 

t^4 # tea, mjR&&ffi&amm&Tf>z>k.&&t 
z k timt-ttmcm i fci3»ofli«sBR^^-r a. 
[ if *ii 5 ] ffirie«*¥Stt , fraeawt^afc =t -?t 

Kf#£ tifcfiB'Iff^ 5 *> s ffisEffiM? r A MZH \ >X 

im^Kx^&Mimmm^mw^hcr>azm_jtL 



xmmmmitmTK-tz, z t zwm t -rm im i tcts 

[11*316 ] Ii^7 hV-^±T'^ii§iiTV^4^: 

JEE LX V > 4 flailiiftiSti^* -5 T . 
B9ISS*W ISt#4^TV^45:^J^x. ^tiX v^m 

Kfiaif m k ws&eomftToioj&m i,z$-tt>tix^& 

mtmmz£^xm%izft.tdm.mmn o *> , mmm 
?t4 Mzte^xmimmm^mm^ttMik Lt^mmz 
mmmm&&nmttift>ixx^& h<nzm%hxw& 

z k zmLb^&imm&jm. 

[fM<il7] zi>t:jL-?lzm'€tSit&Zktz]:K). 

x v * 4 mis k tmm* -yhv-? ^^mmmtc&m^ 
-tmrnxfo -y ximm^tsmmim^-t^mmmn 

g£^figffifg££ttJ£WtT&4i8l7T^£ffl 

m-t&mwk. tmcoM^m-tmimzm^\^xtm3\ 

4fir«fffB£^-r4^t Srt^ytfA-^tcfl&fc 

MiB^?ijj&^#^£ttm-r4^ai: . 

tees+f fg t R*s^)»iai7D^s:^!ifc:#-t ^^-c v ^4 

4^at. 

mmmiz£->xm.&%iiti®MmmcDo-ib, mmm 
yrA MzB^xmiiLS.mmcr>^mioMm.k ttzm&z 
m mmmmmmnif ^tcx ^ * t t TKffl 

[0001] 
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4. 

[0002] 

y^_^.. y h_hcOWWW (World WideWeb) yXfAt 

ax ^ 4 w e b 'jcomitm^mzmtmifx 
iiy 5^y^iHRiii»s*tT v^4 . 

[0003] iixyy*y*;^ 7 b±«tffgSr)Rft-f-|» 
H&eoifik^k Ltn^ 7 NM£lfc£;fiT^4 

mmZfiX. A hl.T&mZtlX^&W eb 

s<— y^^U^SHTML (HyperText Markup Lan 

guage ) y t a jv^mrnMmi^hKh , imwtt& 

f«We b^-yCoHTcO-f -y bUZ&fh 

mmtzmmztiz . 

[0004] 

•CCO^T cr>mm £ >- 1 a - * T S »&<J fcfr =Sf r> T 33 
0 . * ifcUiAIB^HBft J: 4«fg«»fW^L^ 
<?)X\ JRIfeSiifcfllflkoK-f * ^> y^^mmcoM 

mcomzmph^-v- K£>— mz ± s^s-tf^-^T 

X, -f-^-^^^rffltt^fftV vfflfSg L fit H^UWe 
b ^- s^f *3 tt3Rt&K&£ < -g^T L £ £ 9 "T 4 

[0005] vx±.<Dmm^m^, iix>/yA«t 
4 ft #^sgoisn ^mmmmmcomm a wt*t t-c i t> 

X*$> 4 . 
[0006] 

- ? ±t-^w ^it v ^ ffifBt* * tzx v ^ mil 
t. tmm^-y bv-7 ±cDmmtt&wz^mmT 



xmmmtt^m^mmzttfcLx^&tmmMZiSiTr, 

[0007] fU, 4^0JB««tAtoTftSflH8 

iBR^i^r AMzmkthmk^fkk , ma&mm& 
m-tmmzm^ ^x mi a*? 1 7 r 4 /i^xjbr £ tf =&■ t 

fifcffififilffio 0 *> . mrlE^ 1 7 r A Mz& v ^TfS^ 
[0 0 08] £iT, Mia^SffifB^^iX-CV^^ 

asN s tit ^ £ fcftiitiTO ^ * i 6 n s Jcmmm 

cor. flBISBRWlft**** IB«BR*0«5(IB WtcM LT 
[0009] ±aLfc*»BBfc:ffiftfll««BR^ 
iz^itfix^h^mz5-^c?nx^hmmz~o^x<7) 
. tfriawtt^ifriasiiiWttT"^-!.*^ 1 

mmiz-stttx ^zx^mz^-t httx^h mmmm 

[0010] ^WfiScfcifUf , SSMffBtiJ^Tl^P 
SilfHS^T V^4 fc i i: ^T'S 4 J: 3 tc^4 CD 

t\ ; <os:^ic*4 m&z t-?xz <D^cmm mz. 
mmmmv^cnt vxm5\? r a Mzmmx-% 
iz^h, 

[0011] WaLfc^BBfcffiftflHBKBR^^ 

tz^tnx^h^miz^thnx^&m&mzx^m 
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Tv^fcstii, mm&zmimmmmxfo&bfrtt 

[0012] -rcOflffiStCifttf, S»'fffgfcfe^T^ 

Sit & i 5 fcfc £ . ± , Hfi* Lfc*3MHfc:« 
SflWMWl^'rAfctJV^T, fulBiS^SJi. m§m 

fc Ofl fc'flKfc LTKfiHHff J: 3 fc LT fc J: 

[0013] -roffi^i^Hr. fcS^M^^^ix 

ft* *stc i: o t i w jaMtfWiaa** 1 ± o w ^ t « 

b Lxm3\y r A Mzm§T% £ X 3 0 , flfflitfeR 

k&&. 

[0014] *w%m\<mm.wbr>xfohwmm 
-tm±, mmjtmmmiz^ttiz^&x^mzif-t t>ti 

l§tc , R*Bt:o v ^T^fulBfiSffi^ t flMcoff&ttJTU 

tofEJga i ^ r 4* l , fris^^r^^^mist^ 

[ 0 0 1 5 ] «r*j, J^Lfc3f^t:fllSfl|f|«BtttF3fe 

tSltf i k t ioTKTJ* LfcWH*)!ISIi't4 £ k 
[0016] 

[ 0 0 1 7 ] HI fcfcwc, tffg^^-f b 1 , flHHI 
f£+Mh2a, 2b, 2c, 2 d , Wf :*-— f Wt. 3 
a. 3 bi4^-r^xtafi*>y hV-^-Cfel.^y^-^ 

Z. b ifiX% h . fflfSftfftiSlM h 1 1±3-— ifSB*3 a Rtf 

wwt-Ayxfiti o , mmm ioo, AHRtt 



3§^aas2 o o , ffimf-f^-xwms oo, 

[0018] titmmmi oo(M^-*7h4±c 

$a^w$fi-f-^^-^«agP3 oocuwi. 
if®2 0 0ii y^-^-y h4iiYtxm^ix.x 

<&ffimmm<vmmzmt, mm^~^^-^mmu3 
ooiz^mztix^&mmcDmmift^K %<nmm<r> 

[0019] mkT-?^-z. f mm3 o o timnit 

Sgp 1 0 0 J: ^THXm§^Wfg<75»». &tA«|§^ 
^tagB2 0 Oti4flH8««OR36«ff*ixh.4. www 

< &jRA$^flnB»fimni^ias 1 o o tug^-rsM 

a. -f^-*7 b4^^LTiM^fLT<4fflffg^<?) 
S*Sr«$S^^aJ2 0 OtciEjU-rSffiil, fttXflBR 
«SSR«3aSP2 0 0^^>il^ixT< iflMiWR^ttA£^ 

-tmrntm^tix nsweb^- ^jim^s^fi 1 * 

[00 20] flHB»flt^^ N2a, 2b, 2c, &Z/2 
dii, ^-fL-TfLWe b-^-i/2 0 a, 2 0b, 2 0c, 
&IK2 0 dSr-f h4±-C^ffi-T4WWW-9-- 
AyXfi^tAI.. ^rfe, HlfciJ\(^«4-p<DflBHi 

[00 2 1 ] JL— rSH*3 aaV3bli, -til-WlflHB 
Itffi^-Y b2a, 2b, 2c,2S.y ; '2 d^>1f f8«6S^>f 
M3&»6a*SfL4Web^-^**IBBM-6y7b^x 
n-fcS/^^f 3 0 a.Rt/3 0 b SrUff'Brtg^ 3 y b° 
A-i'T'J £ > i 0,-f>'^-^-)'b4±T'^ra§tl.TV'>|,ffl 

-oxm^tih. 2r*3, 01 HiJ^Tti 2o<^i— 
^LXV^htK Jyf-^-y h4t^§tl.£J--1f3g 

[00 22 ] fctJ, itl^^+ffffi^^-^ h 1 , fllWi 
ffi-f b 2 a , 2b, 2c, MS 2 d , JL— <fS*3 a 

KR*W»i-« C P U t , ROM^RAMS.^'MmiB'H 
iaKrifa^SrO, CPUt4M*rt^lft««iS*4«I 

MW-r - ^ *qRf§3 a £ A^lSE fc , -r -f XT V A % b' 

#a<50r-^^^ttJL-Hftii^rr^a}^gpi:, ^ 
■yyv-7\zmm-htdsbcoA y^7i-»m 
-r s i / f a5 * fit x h 3 y e * - 9 % m v ^-c «xw s 

rki>t'&£. 

[00 23 ] iJcfcH2teovvOKi5H-4. RHtt^BJ 
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iSl/C^S. H2£SrTJ:ofc, 1 0 0 HW 

e b^-yjRswagpi i o&xfmsi&s&Mi 2 0 &m 
txm^ti. mwsmmmm2 0 oi$<mmtm2 1 
o&tx«es*ss«asu 2 2 0 %ffiz_xwf&ztx. t-? 
K-x^mms 0 oim3\?T 4>i-3 1 0 . tanaes? 

r^f/L'320, St/»iii l JXb7r'f^330!rI 
£T«J&£;f"U f UWWWt-A'4 0 OliHTMLff 

1 O^Iit«§^^, 
[ 0 0 2 4 ] We b^-^'IRmi : agPl 1 0{±-O^- 
^-•y h4±T'^HH$tLTV^We b ^- V 2 0 CO g »JR 
JfcfcjeKBWKfPSro . Sf&lffr&Vaffi 12 0(1 Web 
^-^'iRmVUgP 1 1 OteioTJRJfcSiUfcWeb^- 

?Kifc<^t**4lligii:5r*l'3-HSffiKL.TSRg[7 
r4)V3 1 OUzSMth. Sratf&ftWBSSl 2 0liWe 
b^-^'JBWaS 12 1, ^Bttm^IfS 12 3, §g?IS 

[ 0 0 2 5 ] We b^-^'WtlfSPl 2 lfiWe b^—'J 
2 OWjSff^otWe b^-S/2 0 fc^H/C^-S 
HTML7r-f /KO^fclB^ $ilT^S#HTML^ 
u r?— K coJp.fi £ -f § H TM L7-f /l^x— 7)V 1 

2 2 . #isffim«agp 123m html 

7 4 JVf'r— 7*)V\ 2 2\z^tiX\^h^C¥n^ii\^X 

ftl^^We b^.— i?2 ocr>mmk txm^^titzb^ 
h wmpse@msmm 1 2 4 T-fi^^u t tz . ht 

ML7^1/^f-7'/H 2 2iZ^ixX^^6^ncom 

[00 26] 1 2 6 il ^^J«f/f^ 1 2 5 

We b-^.— VI OtCOWCCO-'f >?—^ y h4±lZ 

tui&mmwx&wmmt , ^comm^tix-a^w 

eb^-i>'2 0«l*)i;, Web^-y2 0tWI 
t3|S^§^T^^«^^'^3itffljSWM 1 2 4 T^jWf 

izi. ^xm^tifzx^n^com^z^-ti, z\tz^-t 

M&7 ?7 b £ *ft6ftl*fc*gi 1^3 - F £§g?l 7 r 4 ^ 

3 1 0 fciMW* . 

[0 0 2 7] 1f#^gP2 1 Oil -f V*— **b4fc 

fix ^4 1 ^-m^(DJ-—^^x'mf^tix v 

77W3 0 Offlff (c J: -> T -too.— ^JKQ&»6iSfefL 
T< 4flHRl««S$*WW1JW'qB4 0 oa»iS>JR# 

DtHLT^^g«gP2 1 1 f Lt, §S3I 

umLxmm^y t^;i/3 2o ttgtt-f-^, . 
[0028] nautm&m 2 2 0 a. tt fB«s?a» 2 1 

0 fc J: SKaW^TT* t . BOBS*? r^f^3 2 0C 



mmb. % corns x-zcomwftmizttifc l t ft 

§ tiX v R14 7 v 70>&tm b £ mmm% Vxh7r 
•f;i/3 3 0tratl», L-T„ ^m^S'J^.hT'T^ 
^3 3 0 t^S*LfcffiKflWB*-&§f»(c«^ , Cy- h 

[00 29] HTMLfft£St4 1 OJi, iii*'JX h 

77^^330 itzmm znx^&v-hznfz &mmm 
mm&m vxv^im-oxz commune u x 

bfimMZtl&We b^-y&gSnHTML7T^( 
/Pfc ffJ^-T « . f£#t§ titz HTML7r / f Mt7"y 
3 0 ^Hfr § ftT o •> 6 i-f'S*^ttWWWf-y s; 
§E4 0 0Ci^^-*yb4tjiaj$iriS, 
[0030]*):, flWB8BJW>r h l ey&thffltfm 
Ml 0 OtCijV^Tff^^tLS, We b^—iScoHm&lS 

m^co^commcommiz^xmm-t^. H3ii, 

HtCfeV^T, (b) tStHTMLy-A^y^^fS 
0 i -?XfflWZtl& kHi(a) {C^-THHj&^^S 

[00 3 1 ] iiTH4(coi,^T»Wr4. RHHflm 

Web ^-s/<J5JRmZtVa5l<^^3& t fflfffi^l 
Ml 0 OT'lf^^So S-Ts S 1 0 1 t&WC, igft 
<7)H#*V ^f^^iXTV^We b^.-^'2 0COJK* 

a -cfc i. 3W53&»3&rw eb^- i-'WMtmm not 

[00 32] SI 02TIW >?-*-vh4±.T-£:ffl$ 
fltUSWe b^— i?2 0C03SHI • iRS^H^We b 

[00 33 ] S 1 0 3 Tit. We b^-^'J$WSP 121 

20S1 ■^.-> ? -fofiL^-r^46co*f^ y^xhh^- 

i>'#4 b Lxmmzti&^m.mizmm 1 

tL-S, S 1 0 4T1±S 1 0 2(D%mtZ£^xmkZtlX: 
We b^.~-J2 Ol/Ztiif^—^-f >7mC0i%]±C0W. 
X^Lm^tth ^- i>^ffijt^W eb^- ^tWf* 1 2 
lCi-oTHWrStl, KS10 5CS^tHTML7 
<)V9t—7")V 1 2 2*iWe b^—i/ffiffim 12 1CJ: 
-oX±f&%tL&„ 

[0 0 34] H3tiUcWe b-"<— i^j&^ifRS^S 
HTML7 ^/P^x— fcSjrtf". Web^-y 

BWfafl 2 lfcj:-jtEI3 ( b ) t^LfcHTMLV- 
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tf>&$LZfi&« S104^W§(:^^tEI3 (b) 
L%tft>mzMW-t& LWeb ^-iSMffi Sfl 1 

2 K11 |?tfM^£7)HTMLy-^(7)*5:. 

< ; B0DY> ; COTM-h? 7 k **T ^ h «ra1tCi*4 ilT I ^Sgfrfr 

[ 0 0 3 5 ] 03 ( b ) (;S1"HTMLy-Xi^S5 

WCiftBJ3-r.S> t , HTMLy-Xtfc(«*tffl 

1 E^lfB^TS) 4 < ; B0DY> ; ? ?"«IB#i ? y k m~7 9 ? k com 
IzmttlX^&ffift, -r^*><;B0DY>;^^t<;/B0DY>; 9 
7'klzmtilX^h^fttfS 1 0 4<03®HtJ:4J»W<0 

[0 0 3 6] iiT, W*f«*^)Saf*KtSlt4« 
30«< ; BR> ; ^ ^aWEiE § *LT V ^ ffiW £ TWgfcfr . "t**) 
*> s r <;F0NT SIZE="6"COLOR="#FF0000">;<;B>;ffiW4iI<;/ 
B>;<;/F0NT>; j &&$Mbfflm8rZtll . IiT\ <;F0NT SI 
ZE="6"C0L0R="#FF0000">;^^^^14, <;F0NT>; cr>m%i?7' 

flfflij **S?J9!lfc:oV^T» :S?JMMX£ r 6j 
&I>*S S t Lfio;fc^fe£ r # f F 0 0 0 0 j 
mx-^Kh-^kLxm^-thZk^LX^h, % 

a , ; mmx'^tih&im&xfo h . 

[0 0 3 7] ttz, <;B>; <;F0NT>;<JO^^^ 
US . S 1 0 4 (c J; £ «W®aii±a L fcWWSrfi 1 ^ -3 

i> cnxti *) . zcomzmi^tti s i o 5 commz x -> 

m J Ji^t^WCft SiiiJSt TSTRINGj ftffgtt 
otvf ^?t^fXj Sr T6j , r&i £ kLX 

r «£'Jj A* r BRj X'hhZ\k(7)^^k-th. <;BR>; 

2 2<75fl52frBfc*llftS*l4 . 

[0 0 38] H3 (b) HTMLV-XC0 

<T 3 Ltl5(:StHTML^-y'7^^^ 1 2 2« 

^"-r^k'k^o xoiz, mwnom^^^i xt*tr 



[00 39] 04 COIMBJ^M 0 . S 1 0 6tli:HTML 

m^tix ^i, k^-^h^mm^mm^mmi-x 
*<rm*x^*9m&$cmmmm 1 2 4 ■etrsr 

[0040] S 1 0 7 -CliS^JWfW 1 2 5 K i 
HTM L 7 ^J^f -7IH 2 2 C0#ff ( 1/ 3- K ) 

Jt#ttAkoi*oJB£^-4fcKxosK-< >^T"ft 4 7 <r ;P 
#1S„ S108T1i. Ji^Lf,:^ y^nl:± 

-3 t ^tihmz^tix^h 3c¥¥\<?mm z ^-rr 

-MHSTRINGj T'ftS^S^^?W?ffSl 
2 5 1 «t oTWJSiu i^BflraBtSfe^Y e s^r^if 
S 1 0 9 teJSa#3tt», No^li'S115 

tr. 

[004 1 ] SI 0 9TI1 7^;^?K-<y^ntc«J:-3 
T^^fLSfft^SfLTV^^^lJtC, S 1 0 

ijuBawx^wwraj 1 2 5 1 i ^Tff^^tis . 
Mai<50paii>f*jdit- 4. ^ofiL s 1 1 ofcfcwc?* 

■ox'tthti. si 1 lfcfcv-i-c. zcmm&nntzm 
iu> $> zcn&m-tfzmx'fo &<mimm-t&mmtf-%¥ 
nmmi 2 5^^-oxm^xn^hti^. 

[0042] ^-i3. s 1 1 OlZWt&mmnWOliiLO 

mexm k -im&z ^-m^mm zm^x * ^m^mm 
mis t -t* ^^ijou 0 as l *%<r>%sm<r>9t 

[ 0 0 4 3 ] s 1 1 2 Tit. AM Ltz s 1 1 1 commz 

12 5 iTfUIS^ iO¥'J^SS**Y e s ^rfetf 
S 1 1 3t»^jt^ No^tTSll 5t3E0a* J jt 
ti. Sll 3Tfi. Mi*L/iS 1 1 lWWItiotft 

58»k, ^com^^t.tiXi^zJc^mzMLXSi o 
9 ««yst: J: -5 •£ KSe$*ufc:S??JBtt i:Sr-e ^Sffi 

KS11 4 0^iItcJ;^-r4iJi!c$^^IA i ^l7T 

•op 3 1 otceii§^i> 0 

[ 0 0 4 4 ] S 1 1 5 Jt*^J»Wg|3 12 5tJ:-3 
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xmm Ltzy 4 )v?xa nosij*** i tsmnhhti 

S„ S 1 1 7 4f\r9-KA y?n03g&0&MtZ 

i-5t^§tlliffi s HTML7-f;^f-/;H 2 2 

W9f 1 2 5 1 i o T WS'J § ft . £ <0 fMBSa** Y esi 
£>{£M^S 1 1 7fc3tt», Noif,lfSl 0 8^ 

[0045] SI 1 7T"i4. We b^.— ^HWgE 1 2 1 

£>ft& 0 s l l 8til ^—VKA v?mam&<nmM 
lz X ~> Tf£5££ft S^- S^'W ebi- ^'JKm^agP 1 
1 OtCi-^TItXmSft/^We b^— >-'2 0c7)ftHcO^— 
i^jg^T L^"jfc*WWe b^- >?WP5fgp 1 2 1 

No^yfS104 ' v sJ8S«#BS T ±3* L ^XPE^itt 0 
iE$ftl>o 

[0046] J^ircoaSHsWfSl^ftJSl-C** . & 
t» ±3*Ufc*gi*JiE«Bafc:*iitft S 1 0 6^fMt t 
TX&m&m&l 2 4T"fT^^ftS^?[«e«S£»IS 

[0047] ii\ SI 2 lfcriJWC, HTML7^;l/ 
^f-7>l 2 2 CfcftS r «5j'Jj * s r STR I NGj 

wc. &x¥*rA xcotamm. t^HTML7^/i- 

[0 048] S123T11 WISSSftTV^lt^ 
-f XfcOl cOX*PtfJ3lip S ^TX^# $ ft 5 0 S124T 

ti . s l 2 2 coast i -5 t JKfJ Sftfc ass****^ 

^XO^g^JIgtJgfttJIJSSft, tt<S125fctiV> 
T , * <7)JHfHI#*»WB S £ Jim o a»Sta5*MJ}!I 3 ft 
4 . * LT , i^JRBMftft&JY est^^il t®a 
1 2 6(:». — S\ iiOHBBftR^NoTftiia 
t±, S 1 24«JS11jWI§9 3E3ft£, 
[0049] S126TI1 IffX^ >y TCD»Ma<7)iS§ 
S^Y es^-)ttl IZ^COWMCO S 1 2 4 

i<0Sm*^if-f XEsiz e ELttftt^-f XfOKtt 
CO^SftTV^^^iJ^'. We b^<— >>'2 OttiV^T 
^iW^^fT^-sTV^^^ijT'feSt^^ftSo 
[0 0 5 0] S127T14. HTML7^/^f-^ 

1 2 i\,zavfh r «syj # r str i ngj tj^ffc 

^^ftTV^^T^^iJcO^a^It*^ r^ff^ij 
tfctf&^fetetlTaiSftS. S 1 2 8T11 



1 2.2\ l z^fc^^h^x<^x^¥\<mcm.\ l zm-h%- 

^C¥^<ryX'm.<rM\^m& £ ft £ . 
[00 5 1 ] SI 29TI1 "FtoRJESft-O^^Sffe 

iz^^xcommmmmc^mn^tih, s 1 30-m, 

S 1 3 1 iZ&^X , i^S^-feC n^lffeE c o 

i orb LTf3£§ft!>, gB^ssi&ifcsaszN&g 
mmzis^xiz. ;«»feEc o 1 ortaiwt 

'f&COm'&COm.MZilX^&JC^W. We b^-^'2 

o is \ SSiSa^S' ftSr r>X^%> -%¥5\Xhh t »5r 
[0052] S 1 3 O&t/S 1 3 1 «^a£*$;t. Ski 

corns im&miBt&ti&mtM/n' l . mm l £ a 4 ^ t 

[ 0 0 5 3 ] 08 ( A ) (i, 1 ^rSWe b*<—zslZ 
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